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Abstract 

We study SU(iV) plane- wave matrix theory up to fourth perturbative order in its large 
N planar limit. The effective Hamiltonian in the closed su(2) subsector of the model 
is explicitly computed through a specially tailored computer program to perform large 
scale distributed symbolic algebra and generation of planar graphs. The number of 
graphs here was in the deep billions. 

The outcome of our computation establishes the four-loop integrability of the planar 
plane-wave matrix model. To elucidate the integrable structure we apply the recent 
technology of the perturbative asymptotic Bethe ansatz to our model. The resulting 
S-matrix turns out to be structurally similar but nevertheless distinct to the so far 
considered long-range spin-chain S-matriccs of Inozemtsev, Beisert-Dippel-Staudacher 
and Arutyunov-Frolov-Staudacher in the AdS/CFT context. 

In particular our result displays a breakdown of BMN scaling at the four-loop order. 
That is, while there exists an appropriate identification of the matrix theory mass 
parameter with the coupling constant of the N — 4 superconformal Yang-Mills theory 
which yields an eighth order lattice derivative for well separated impurities (naively 
implying BMN scaling) the detailed impurity contact interactions ruin this scaling 
property at the four-loop order. 

Moreover we study the issue of "wrapping" interactions, which show up for the first 
time at this loop-order through a Konishi descendant length four operator. 
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1 Introduction 



In this work we report on a high order perturbative study of plane-wave matrix theory 
in its large N limit, which is a supersymmetric, mass deformed SU(iV) gauge quantum 
mechanical model intricately related to maximally supersymmetric four dimensional Yang- 
Mills theory through a consistent reduction onMxS 3 [2j . It contains quite involved technical 
parts, in particular on the side of realizing this computation through a specially tailored 
computer program designed by one of the authors (TF) to perform large scale distributed 
symbolic algebra for the computation of planar diagrams. Readers interested mostly in the 
computer science aspects of this computation are advised to directly proceed to App. 
which is largely self contained. 

The motivation for performing this computation grew out of the mounting evidence for 
integrability of planar N = 4 super Yang- Mills theory (SYM), which should be of prime 
importance in the AdS/CFT correspondence. In a nutshell it states that the dilatation 
operator of the planar gauge theory, which yields the anomalous dimensions of local com- 
posite operators in the conformal quantum field theory, is given by a long-range integrable 
spin chain Hamiltonian [3J |IJ |SJ H3 IE] 1 • This statement is by now firm at the one- loop 
level for the full theory in form of a (non-compact) su(2,2|4) super spin-chain [5] and has 
been extended to the three- loop level in a closed supersymmetric su(2|3) subsector In 
this picture the loop order of the considered dilatation operator is linked to the spread of 
the local spin interactions of the spin-chain Hamiltonian. In the smallest compact closed 
subsector of the N = 4 gauge theory, the su(2) sector, the /-loop order contribution corre- 
sponds to a s = 1/2 spin-chain Hamiltonian of Heisenberg type involving the interactions 
of I + 1 neighboring spins [I]. Next to integrability a key property of the established form 
of the dilatation operator is its perturbative "BMN scaling", which is also predicted from 
the dual plane- wave superstring pQ. This scaling property concerns the form of the Zth loop 
contribution to the energy eigenvalue (or anomalous scaling dimension) in the thermody- 
namic limit of large spin-chains: its leading contribution scales as 1/L 21 , where L is the 
spin-chain length. 

In [2] one of the few known long range integrable spin-chain models of statistical me- 
chanics due to Inozemtsev ^U] was identified as being able to provide the established form 
of the super Yang- Mills dilatation operator up to the three-loop level. This is of course very 
exciting, as this model could thus provides us with exact non-perturbative information on 
the M = 4 gauge theory. However, as was explicitly demonstrated in 0, the Inozemt- 
sev chain predicts a breakdown of the BMN scaling property at four-loop order. We wish 
to stress that such a breakdown of perturbative BMN scaling is not necessarily at clash 
with the AdS/CFT correspondence, despite the BMN scaling property of the dual plane- 
wave superstring spectrum. A similar breakdown is by now firmly established in the "near" 
plane-wave superstring and in the matching of energies of rotating solitonic AdS§ x S 5 
string solutions ^2] a t three-loop order in perturbative gauge theory. This discrepancy has 
been attributed to an order of limits problem, stating that a perturbative agreement should 
not be expected jH]. Hence, based on these results there presently is no reason to expect 

1 For a recent review see UJ. 
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perturbative BMN scaling in N = 4 super Yang-Mills! 2 

Nevertheless the assumption of perturbative BMN scaling is a very fruitful one. Indeed 
Beisert, Dippel and Staudacher [H] have argued that the su(2) dilatation operator is com- 
pletely determined up to the five loop level once one assumes integrability, BMN scaling 
and structural constraints from the underlying Feynman diagrammatic structure. These 
authors went on to conjecture an intriguingly simple all loop (asymptotic) Bethe ansatz, 
capable of reproducing the eigenvalues of the explicitly deduced dilatation operator up to 
five loops. Inspired by this structure a similar but distinct asymptotic Bethe ansatz for 
the full quantum AdS§ x S 5 string in the su(2) subsector was conjectured by Arutyunov, 
Frolov and Staudacher in [Tl|. This ansatz accounts for the above mentioned three- loop 
discrepancies and remarkably reproduces all presently known data of the string side of the 
correspondence: In its thermodynamic limit it reduces to the Bethe equations of the clas- 
sical spinning string sigma model of JH]) it reproduces the near plane- wave corrections to 
the quantum string spectrum deduced in and leads to the expected A 1 / 4 behaviour of 
the spectrum at strong coupling. The associated spin-chain Hamiltonian to the first few 
orders in a small A expansion was established in |16j . 

Given these results it would clearly be of central importance to determine the exact form 
of the su(2) dilatation operator at the four-loop level and determine whether it obeys BMN 
scaling and is indeed integrable. This, however, appears to be beyond present capability 
in the perturbative quantum field theory, where the state of the art is at three-loop order 

QUEUE]. 

Our work adds a further piece to this puzzle by considering a reduced (toy) model of 
the full quantum conformal field theory. The scaling dimensions of the gauge theory on M 4 
are equivalent to the spectrum of energies in a radial quantization of the model on K x S 3 . 
It turns out that one may consistently truncate the Kaluza-Klein spectrum of states of 
N = 4 SYM on the three-sphere to obtain a reduced gauge quantum mechanical model 
[2J. The resulting reduced model is the plane- wave matrix theory (PWMT), which has 
also been proposed as a microscopic definition of M-theory in a maximally supersymmetric 
eleven-dimensional plane-wave background 3 . Moreover, as was shown in 21 j, it not only 
inherits the planar integrability of the M = 4 gauge theory, but leads to precisely the same 
dilatation operator in the closed su(2) subsector up to the three loop level, where the SYM 
information is firm. In order to perform this matching one has to establish a relation between 
the coupling constants of the full and the reduced model, which is corrected order by order 
in perturbation theory. In fact, this relation is uniquely determined by the requirement of 
BMN scaling behavior. 

In this paper we push the perturbation theory of PWMT to fourth order. We explicitly 
compute the planar effective Hamiltonian in the su(2) subsector without relying on any 

2 In the literature arguments for the mechanism of an all loop BMN scaling property of M — 4 SYM have 
been put forward in In particular our finding of the "naive" BMN scaling property is in line with these 
arguments, the breakdown of BMN scaling arises from subtle boundary effects when two impurities come 
close to each other to be discussed in section 4. 

3 It also arises as a regularization of the light-cone supermembrane in this background, or alternatively 
from the dynamics of DO-branes in type IIA string theory |2U| 
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assumptions. The model remains integrable at this loop order but ceases to obey BMN 
scaling: the leading behavior of the energy eigenvalues scales as 1/L 7 - after one performs 
the above mentioned renormalization of coupling constant. Indeed the obtained four-loop 
contribution is neither of the Inozemtsev nor of the Beisert, Dippel, Staudacher type, while 
it clearly agrees with these spin-chains up to the three-loop level. 

We elucidate the underlying integrable structure of the system by constructing the as- 
sociated "perturbative asymptotic Bethe ansatz", a novel concept very recently developed 
in |22| . The extracted S-matrix turns out to qualitatively agree with the quantum string 
S-matrix of Arutyunov, Frolov and Staudacher [H] but quantitatively disagrees by an addi- 
tional power of the coupling constant in the exponential term involving the higher charges 
- which is responsible for the breakdown of BMN scaling. This finding lends support to the 
suspicion that the generic form of a long-range integrable spin chain S-matrix will involve 
an extra exponential factor with a characteristic dependence on the higher charges of the 
spin-chain, first proposed in ^3] an d further developed in |22| . 

Furthermore we can shed light on the structure of the "wrapping" interactions, which 
are inaccessible in the asymptotic forms of the presently conjectured all loop Bethe ansatze. 
At four-loop order (including five neighboring spin interactions) we have for the first time 
a non-protected operator, which is shorter than the spread of the dilatation operator: a 
Konishi descendant of length four. It turns out that no natural prescription of how to 
act with the five spin terms on this shorter state (wrapping or dropping) reproduces the 
explicitly computed scaling dimension. 

On the computational side, the four-loop study was achieved with a newly developed 
computer program which constructs all planar diagrams of the perturbation expansion 
and performs the corresponding algebra. A change of the platform and the use of heavily 
improved algorithms led to a performance increase by a factor of more than 100 with 
respect to the previous version when comparing the three-loop computation times. Only 
this enhancement brought fourth order to reach, but it still required some 88.000 CPU 
hours on current computers. 

2 Effective Hamiltonian of plane-wave matrix theory 
2.1 The model 

The Hamiltonian of plane-wave matrix theory can be written as H = Hq + V1 + V2 with 



#0 



tr [|PjPj + i (f f Xa X a + \M 2 XiXi - ^©71236 



Vi 



v 2 



-Mie ijk tr XiXj X k - tr 67/ [Xj, ©] , 
-^[Xj, Xj][Xj,Xj] . 




The degrees of freedom are: 
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field 


notation 


mass 


modes 


symmetry 


light scalars 
heavy scalars 
Majorana spinor 


X a (a = l,...,6) 
Xi (i = l,...,3) 
9 Q (a = 1, . . . , 16) 


M/2 
M 
3M/4 


Oiai Q^a 

bi, b\ 


SO(6) 
SO(3) 
SO(9) 



The "mass parameter" M is in fact dimensionless 4 and will serve as (inverse) coupling 
constant in perturbation theory below. 

In (|2.1|) we used the index / = (a,i) to refer to all scalars at once. All fields LY a (i)] rs 
etc are function of time and take values in the Lie algebra of the gauge group SU(iV), r = 
1, . . . , N 2 — 1. In the fermionic sector we work in a representation with charge conjugation 
matrix equal to unity and with real and symmetric Euclidean Dirac matrices (ji) a /3- 

For the quantization, we introduce the following creation and annihilation operators 



Pa 



X n 



4 ( a a + a a 



Pi 

Xi 



&a = 

with 



Cq ~t~ c]j 

c = rre 



im v"" ~ a j " i v^M v"' "v ct = n+6 

where iP 1 := |(1 ± ^7123)- They satisfy the canonical (anti-)commutation relations 

[(fla)rs; ( a fe)i«] = $ab (8st$ru ~ Jf8 rs 5t u ) , 
[(bi)rs, (bj)tu] = Sij (S s tS ru — jf5 rs 5t u ) , 



(2.2) 



The free Hamiltonian now reads 



tr 



M 



3M 



—a\a a + Mb\b i + —clc, 



(2.3) 



(2.4) 



Physical states are constrained to be gauge invariant and are given by traces over words in 
the creation operators 



|^>=tr(4 l 6t 2 4 3 ...)tr(...)...|0>. 



(2.5) 



In the planar limit we concentrate on single trace states. The number of oscillators in 
the trace is called the length of a state. It is common to visualize a state of length L 
as a "spin-chain" with L sites, where every spin orientation corresponds to a particular 
oscillator. In the case of the full PWMT, the spins are elements of the 17-dimensional 
module spanjaa, b\, c«} of su(4|2). In the following, however, we will be interested only in 
states that are built from fields of a certain su(2) subsector. In order to define this subsector 
it is convenient to relabel the so (6) fields according to 

zt = 72^1 + ' 2i = 72^ - ' z = 7S (fll " ia2) ' 2 = 75 (ai + ia2) ■ (2 - 6) 



4 In terms of M-theoretic quantities it is given by M = where fi is the parameter of the plane- wave 
background, lp is the eleven-dimensional Planck length and R is the radius of the compactified eleventh 
dimension. 
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and analogously for = -^( a \ + ^ a \) an d = ^75 (°5 + Z£ 4)- The onr y non- vanishing 
commutators are 

[(Z) rs , (Z<) tu ] = [(Z) rs , (Z^tu] = S st S ru - jf5 rs 5 t u (2.7) 

and identical formulas for W"t and Y\ The mentioned su(2) subsector is spanned by Z' 
and W* . A typical state is given by 

ti(tf tfw* z* tfwi z* . . .) |o) = | TTITTTIT • • •) • (2.8) 

It is customary to call an "impurity" within the "background fields" Z*. In the spin- 
chain language W"t refers to the excitation of a "magnon" . Both fields can be distinguished 
by a U(l)i? charge J associated to rotations in the 1-2-plane of the 50 (6) field space: 



field 


zt 


zt 


z 


z 


others 


J 


+1 


-1 


-1 


+1 






The R-charge is additive and hence J counts the number of Zt's in an su(2) state. 

Later on we will need the parity conjugate of (single-trace) states, which is defined as 
the state with reversed order of fields multiplied by (— l) Len § th . 

2.2 Perturbation theory to fourth order 

The aim is to construct the effective Hamiltonian T (sometimes referred to as energy oper- 
ator) for the matrix model. The effective Hamiltonian is defined as an operator similar to 
the full Hamiltonian, T = U~ l HU , which does not mix states with different free energies. 
It is the matrix theory operator that corresponds to the dilatation operator of SYM. 

The computation of the effective Hamiltonian T has to be done in perturbation theory 
for large values of M. Note that the interaction terms V\ and Vi in (|2.1|) are suppressed by 
powers of 1/M 2 and 1/M 4 , respectively, as can be seen by rescaling the fields. In |21| T was 
derived up to three-loop order. For convenience we repeat the formulas of the perturbation 
expansion of T, now including fourth order, in App.[Bl We have intentionally refrained from 
choosing the transformation operator U to be unitary. Though this yields a non-hermitian 
T, it has the essential advantage to produce far less terms that need to be computed. 
This is important because fourth order is very close to the borderline of the technically 
infeasible. To perform the actual computation we have developed the highly optimized 
computer program gemstone which is described in detail in App. lAl 

While at third order it was still possible to find the non-planar effective Hamiltonian 
|21| . at fourth order this is impossible with todays technology. And actually our interest 
just concerns the planar part of T. This is because the integrability is only a feature of the 
planar limit. Also the connection between PWMT and SYM at the non-planar level has 
already ceased to exist at three-loop. 

Note, however, that the formulas for T as given in (|B.1|) will produce any graph, in 
particular non-planar ones. It would be extremely nice to have a formalism that just builds 
planar graphs automatically. In cases where there are no pure creation and pure annihilation 
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vertices, this can indeed be achieved by replacing the matrix oscillators by scalar oscillators 
and their algebra (|2.3f) by the Cuntz algebra [23] • But the interaction terms of PWMT 
unfortunately do not meet this requirement. 

Therefore the implementation of the perturbation expansion on the computer has to 
be supplemented by a special algorithm which guarantees the construction of and only of 
the planar graphs. Simple extensions of the Cuntz algebra approach suffer from an over 
counting or an omission of some planar graphs. Our algorithm (which introduces structures 
like "graphs" , "components of connection" , "domains" etc) is explained in App. IA.3I 

There is another general problem of taking the planar limit. When just looking at an 
operator, as opposed to an operator applied to a state, it is in general not possible to 
extract its planar part. This is because a superficially non-planar looking operator can in 
fact produce a planar graph when it is applied to a sufficiently short state. This issue is 
known as wrapping problem 

The resolution of this problem is to split the computation in pieces: a general compu- 
tation under the assumption that the states are long enough to avoid wrapping, and then 
as many special cases as there are lengths which are not covered by the general part. At 
four-loop order wrapping can only occur for states of length less than or equal to four. 
These cases will be considered in Sec. 12.41 below. For the time being let us assume that the 
effective Hamiltonian acts only onto states of length greater than four. 

A convenient notation for the planar operators was introduced in [I]: Let Pk lt k 2 exchange 
the oscillators at sites k\ (mod L) and k 2 (mod L) of a state of length L. Then any planar 
operator that preserves the number of individual flavours - such as the effective Hamiltonian 
in the su(2) subsector - can be written as a linear combination of objects of the following 
kind: 

L 

{ n l, n 2, •••}:= Pk-\-n 1 ,k+n 1 +\Pk+ri2,k+n2+l ' " i 
k=l 

{} := L . (2.9) 
There are many trivial relations for these operators such as 
{... ,n,n, ...} = {...,.. .} 

{. . . , n, m, ...} = {..., m, n, . . .} for \n — m\ > 2 (2-10) 

{ni, n 2 , • • •} = {n\ + m,n 2 + m, . . .} 

as well as the identity 

{...,... } + {...,n±l,n, ...} + {..., n, nil,...} 
-{... ,n, ...}-{... ,n± 1, ...}-{... ,n,n± l,n, . . .} = 

which only holds in the su(2) subsector and expresses the fact that two spins cannot be 
placed completely antisymmetric onto three sites. 

Now, our strategy to determine the planar effective Hamiltonian is the following. We 
start from an ansatz for Tg in terms of the permutation operators (|2.9|) with free coefficients 
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and apply this ansatz to some particular states. The coefficients are then determined by 
applying the whole perturbation expansion (|B.8|) to the same states using gemstone and 
matching the results. 

Specifically we use the following ansatz for the planar four-loop effective Hamiltonian 

T$ = ci{} + c 2 {0} + c 3 {0, 1} + c 4 {l, 0} 

+ c 5 {0, 2} + c 6 {0, 1, 2} + c 7 {2, 1, 0} + c 8 {0, 2, 1} + c 9 {l, 0, 2} + c 10 {l, 0, 2, 1} 

+ cn{0, 3} + ci 2 {0, 1, 3} + ci 3 {0, 3, 2} + c u {h 0, 3} + ci 5 {0, 2, 3} 

+ ci 6 {0, 1, 2, 3} + ci 7 {3, 2, 1, 0} + ci 8 {0, 2, 1, 3} + c 19 {l, 0, 3, 2} (2.12) 

+ c 20 {0, 1, 3, 2} + c 2 i{0, 3, 2, 1} + c 22 {l, 0, 2, 3} + c 23 {2, 1,0, 3} 

+ c 24 {0, 2, 1, 3, 2} + c 25 {2, 1, 0, 3, 2} + c 26 {l, 0, 2, 1, 3} + c 27 {l, 0, 3, 2, 1} 

+ c 28 {l, 0, 2, 1,3, 2} + c 29 {2, 1, 0, 3, 2, 1} . 

This ansatz is capable of describing any planar operator which acts locally as permutation on 
five adjacent fields. In fact it is even too general for the effective Hamiltonian which possesses 
some properties that could actually be used to reduce the number of free coefficients. 
However, we are not going to make use of these properties in order to have a consistency 
check for our computation. The price for this is just a marginally increased computation 
time. 

Firstly we could discard all terms which consist of more than 4 elementary permutations. 
This is because any transposition corresponds to a four-point vertex in the language of 
Feynman diagrams, and at four- loop order there are at most four of these vertices 5 . Hence, 
one could set 

c 24 = ••• = c 29 = (2.13) 

right from the beginning leaving just 23 coefficients. The second property is the conserva- 
tion of parity, which implies that the effective Hamiltonian has to be invariant under the 
transformation 

{ni, . . . , n r } -> {-m, • • • , -n r } • (2.14) 
This would yield the following further relations 

C3 = C 4 C 6 = C 7 C 8 = Cg Ci 2 = C i3 Ci 4 = C15 Ci 8 = C19 C 20 = C 2i C 22 = C 23 (2.15) 

such that there are actually only 15 independent coefficients. On the other hand we cannot 
assume hermiticity of ()2.12j) since we explicitly destroy the hermiticity by using a non- 
unitary similarity transformation. But we just note that hermitian conjugation acts as 

{ni,..., n r } {n r ,...,ni} . (2.16) 

Now we need to find an appropriate state, where the effective Hamiltonian should be 
applied to. The largest number m of coefficients that are fixed by some state of length L is 
given in the following table, together with a sample chain: 

5 This is the Feynman diagrammatic structural constraint mentioned in the introduction, which entered 
the analysis of 0. 
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L 


m 


possible initial chain 


6 


4 


(ZZWZWW) 


7 


5 


(ZZWZZWW) 


8 


10 


(ZZZWZWWW) 


9 


14 


(ZZZZWZWWW) 


10 


24 


(ZZZZWWZWWW) 


11 


29 


(ZZZZZWWZWWW) 



Hence a single initial state with eleven (or more) elementary fields is indeed sufficient to fix 
all 29 parameters of the ansatz ()2.12[) at once! A similar analysis for the constrained ansatz 
with 15 independent coefficients shows that one would have to use a state of length ten. 
We chose the initial state 

|in> = tr(Zt Z ] Z ] Z ] ' Z ] W ] W ] 'Z t W t W t W t ) |o) (2.17) 

for our computation and apply both the ansatz H2.12|) and the whole sequence of vertices 
()B.8|) . The result of the latter calculation is given in (|A.11|) and fixes all coefficients uniquely 
to 

N 4 r 

Ts = Mil [ -22719{} + 33143{0} - 5628({0, 1} + {1, 0}) - 1044{0, 2} 

+ 984({0, 1, 2} + {2, 1,0}) + 384{0, 2, 1} - 416{1, 0, 2} - 28{1, 0, 2, 1} 

- 32{0, 3} + 60({0, 1, 3} + {0, 3, 2}) + 4({1, 0, 3} + {0, 2, 3}) ( 2 - 18 ) 

- 80({0, 1, 2, 3} + {3, 2, 1, 0}) + 24({0, 2, 1, 3} + {1, 0, 3, 2}) 

- 32({0, 1, 3, 2} + {0, 3, 2, 1}) + 24({1, 0, 2, 3} + {2, 1,0, 3}) ] . 

Note that the requirements (|2.13j) and (|2.15|) are satisfied. Also note that the sum of all 
coefficients is zero. This implies that all states which are composed entirely of Z^'s or 
entirely of W^s or which contain only a single oscillator of the other kind have vanishing 
energy shift. This accounts for the fact that these states are perturbatively protected [21]. 
We will now switch to SYM-language: By a simple rescaling 

D := A T (2.19) 

the effective Hamiltonian turns into the dilatation operator D and the eigenvalues are now 
called conformal dimensions A instead of energies. The dictionary is given in the following 
table: 



PWMT 


SYM 


effective Hamiltonian T 
energy E 
free energy Eq 
energy shift 5E 
E (a)) = M/2 
Eoitf) = M 
E (/) = 3M/4 


dilatation operator D 
conformal dimension A 
bare conformal dimension Ao 
anomalous dimension 5 A 
Ao(<fi) = 1 (scalars) 
A (.F) = 2 (field strength) 
Ao(V') =3/2 (fermions) 
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In addition to this we introduce a new notation for the effective coupling. The perturbation 
expansion of D is organized in powers of 

which we now denote by 

The connection to SYM at the classical level is then given by [2j 

A = G 2 N , G=2™. (2.22) 

With these definitions we find the planar four-loop dilatation operator of PWMT in- 
cluding lower orders from |5Tj to be 

oo 

D(k) = Y J ^D 2k (2.23) 

k=0 



A) = {} , 

D 2 = 2{} - 2{0} , 

D 4 = -15{} + 19{0} - 2({0, 1} + {1, 0}) , 
As = 187{} - 259{0} + 38({0, 1} + {1, 0}) 

+ 4{0, 2} - 4({0, 1, 2} + {2, 1, 0}) - 2({0, 2, 1} - {1, 0, 2}) , (2.24) 

Ds = _ 22|19 {} + 33143 {0} - ifZ( {0 , 1} + {1,0}) - ^{0, 2} 

+ 123({0, 1, 2} + {2, 1, 0}) + 48{0, 2, 1} - 52{1, 0, 2} - 0, 2, 1} 

- 4{0, 3} + f ({0, 1,3} + {0, 3, 2}) + i({l, 0, 3} + {0, 2, 3}) 

- 10({0, 1, 2, 3} + {3, 2, 1, 0}) + 3({0, 2, 1, 3} + {1, 0, 3, 2}) 

- 4({0, 1, 3, 2} + {0, 3, 2, 1}) + 3({1, 0, 2, 3} + {2, 1, 0, 3}) . 

2.3 Hermitization 

In App.[B]we have set up the perturbation theory in such a way that the effective Hamilto- 
nian is computed from the least number of terms. This, however, has led to a non-hermitian 
effective Hamiltonian. Therefore we apply to the result ()2.24(l another similarity transfor- 
mation of the form 

D'(A) = e- ASAs e- A2M e- AAl D(A)e AAl e A2M e A3As . (2.25) 
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We demand that D' is hermitian, parity conserving, of maximal range five and made of at 
most four elementary permutations. This allows the following operators 

Ai = ^{x L + x R + l){0} 

A ? = ["¥ + t( x l + XR) + \{VL + Vr) ~\{zl + zr)] {0} + x L {0, 1} + ^{1,0} 

A 3 = w{0} + y L {0, 1} + y R {l,0} + [§ - x L - x R ] {0, 2} (2.26) 

+ [-| + 2x L - \{z h + z R )] {0, 1, 2} + [-| + 2x R - \{z L + z R )] {2, 1, 0} 

+ z L {0,2,l} + z R {l,0,2} 

where w, xl, xr, y^, ur, zl, zr G M are arbitrary. The result of this transformation is 

D' = {} , 

D' 2 = 2{} - 2{0} , 

D' 4 = -15{} + 19{0} - 2({0, 1} + {1,0}) , 

D' 6 = 187{} - 259{0} + 38({0, 1} + {1,0}) + 4{0, 2} - 4({0, 1, 2} + {2, 1,0}) , (2.27) 

Dg = -^{} + 33127+32a {Q} _ 1403+8a ({Q> 1} + {1> Q}) _ 259+4a {Q; 2} 

+ (122 + 2a)({0, 1, 2} + {2, 1, 0}) - (3 - 2a)({0, 2, 1} + {1, 0, 2}) - ^{1, 0, 2, 1} 

- 4{0, 3} + 4({0, 1, 3} + {0, 3, 2} + {1, 0, 3} + {0, 2, 3}) 

- 10({0, 1, 2, 3} + {3, 2, 1, 0}) - (2 + 2a)({0, 2, 1, 3} + {1,0, 3, 2}) 

- a({0, 1, 3, 2} + {0, 3, 2, 1} + {1, 0, 2, 3} + {2, 1, 0, 3}) 

where the parameter a depends only on the following particular combination 

a = \ (x L + x R f + z l -zr. (2.28) 

It does not influence the conformal dimensions and may therefore be set to any value. This 
ambiguity accounts for the freedom of making a unitary change of basis in the space of 
states. 

2.4 Short states - wrapping issues 

We still need to consider the special cases which have been excluded in the previous compu- 
tation. This concerns all states of length less than or equal to four. Out of the total number 
of 13 states of this kind only the following two may acquire an anomalous dimension, cf. 
discussion below Eq. (|2,18|) : 

\ZWZW) = ti Z ] W ] Z ] W ] \0) , (2.29) 
\ZZWW) = ti Z ] Z ] W ] W ] \0) . (2.30) 

We apply the whole perturbation expansion (|B.8|) onto these states. The raw results are 
given in (|A.9|) . (|A.9|) . Converted to the language of the dilatation operator we find 

D 8 \ZWZW) = -1W28\ZWZW) + 1W28\ZZWW) , (2.31) 
D 8 \ZZWW) = +55U\ZWZW) - 55U\ZZWW) . (2.32) 
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Let us now see whether this result could follow from the general expression for the effective 
Hamiltonian T% by any natural description of how to interpret the action of five spin inter- 
action terms on a state of length four: One could either declare the interactions to wrap 
around the state, i.e. identify site 5 with site 1. This yields the mixing matrix 

I Z-43486 +43486 
4 V+21743 -21743 

in disagreement with Eq. I|2.31|) . Alternatively one could simply drop the five spin interac- 
tion terms, leading to 

1 Z-43646 +43710 
4 V+21855 -21791 

again in disagreement with Eq. (|2.31|) . We hence conclude that there appears to be no 
"natural" extension of the long-range form of D$ to short states. 

This completes the computation of the PWMT-dilatation operator and we will investi- 
gate its properties in the next section. 



(2.33) 



(2.34) 



3 Properties of the PWMT-dilatation operator 

Having obtained the PWMT-dilatation operator, an immediate question is whether it is 
equivalent to the four-loop proposal of the SYM-dilatation operator given by Beisert, Dippel 
and Staudacher jS] or alternatively to the spin-chain Hamiltonian of Inozemtsev jlUl [7] . 
In order to make contact to these models we consider a renormalization of the coupling 
constant A in order to (naively) reach a thermodynamic scaling property known to exist in 
the dual string theory ("BMN scaling"). Rewritten in terms of commuting permutations we 
see, however, that the PWMT-dilatation operator is inequivalent to both of these models. 
Since the Beisert-Dippel-Staudacher Hamiltonian is the unique SU(2) Hamiltonian which 
is integrable and obeys BMN scaling up to four loops jSj (the Inozemtsev Hamiltonian 
violates BMN scaling 0), the PWMT-dilatation operator must either break integrability 
or BMN scaling or both. As an indication of integrability we display the degeneracies in 
the four-loop spectrum and present higher commuting charges. A proof of the breakdown 
of BMN scaling and deeper insights into the integrable structure of the model will be given 
in the next section by means of the underlying perturbative asymptotic Bethe ansatz. 



3.1 Renormalization and naive BMN scaling 

Let us consider states carrying two excitations among J excitations Z\ i.e. a spin-chain 
of length L = J + 2 with two magnons: 

\Oi) :=trH^ t (Z t y- 1 H/ t (Z t ) L - 1 -'|0) = \O l ^) for 1 < I < L - 1 . (3.1) 

The plane- wave string/gauge theory correspondence predicts a scaling dimension of these 
states in the full four-dimensional J\f = 4 super Yang-Mills model of the all-loop form (with 
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the U(1)h charge J = L — 2) 



A BMN (n,J) = J + 2 ] Jl + A^p- with n EN (3.2) 

in the limit J — > oo. This expression displays BMN scaling behavior, i.e. in the limit 
J,N — > cxd, with N/J 2 held fixed, the effective loop counting parameter A' := A/ J 2 arises. 
So let us check whether this scaling behavior is realized in PWMT by simply applying the 
dilatation operator (|2.27j) to the set of the above two-magnon states. For the time being 
we consider only those states where the magnons are further apart than the range of the 
dilatation operator, i.e. the dilatation operator does not act on both magnons at the same 
time. Technically speaking we restrict the action of D 2 k to states \Oi) with k < I < L — k 
and find 

D'^Oi) = +8|0/> - A\O l±1 ) , 

D^Oi) = -52|^> + 30\O l±1 ) - 4\O l±2 ) , (3.3) 

D'^Ot) = +Q\2\Oi) - 374|O ttl > + 76|O i±2 > - 8|O i±3 > , 

D' 8 \O t ) = -17923 i^,^ + 22615 | 0/±1 ) _ l397\O l±2 ) + 244|0 /±3 ) - 20|O i±4 ) , 



where we used the shorthand \Oi± m ) := C/_ m ) + \Oi +m ). The observation now is that 

this may be written as a sum of even powers of a discretized second derivative df which 
\rn.\ — ol/n,\ i l/n. ,.\ r>™ fi^o 



acts as df\Oi) = -2\Oi) + |Oj±i). One finds 



D' 2 = -4df , 

D' 4 = -4df + 143, 2 , (3.4) 

D' 6 = -8df + 28df - U2df , 

D' 8 = -20df + 84Sf - 333<9f + ^flf . 

Now it is easy to determine the spectrum (neglecting the boundary effects of the action of 
D 1 involving two magnons at the same time). On the Fourier transformed basis 

L-l 

\ n ) ■- ^2e 2 ™V- 1)/J \Oi) (J = L-2) (3.5) 
i=i 

the dilatation operator becomes diagonal since 

gf\ n ) = (_ 2 + e ^ + e -^j | n ) = _ 4s in 2 (™) \n) ^> -^>) • (3.6) 
On the other hand, if one expands the square root formula (|3.2|) in a series in A 

»RMN, x . (— 4vr 2 n 2 \ , 9 / — 47r 2 n 2 
A BMN (n, J) = J + 2 - 4A -= — J - 4A ?/ 



j 2 ; v ^ 2 

/ -4vr 2 n 2 \ 3 A4 /-4vr 2 n 2 \ 4 
8A s f— j,— j -20AM— ^5-] + 



(3.7) 
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one observes, that these values are exactly produced by the highest derivatives in D' . The 
lower order derivatives, however, spoil the BMN scaling behavior 1)3.2(1 , But this can be 
restored by a redefinition of the coupling constant. If we set 

A = A r + ^A r 2 -llA3 + l^lA4 + 0(A5) (3.8) 

then the dilatation operator expanded in A r is simply given by clean lattice laplacians of 
ascending order 

D' r ^ = -4df , D' rA = -4df , D' rj6 = -8df , D' r>s = -20df (3.9) 

and hence in accordance with BMN scaling. 

We wish to stress, that the precise series of coefficients 4, 4, 8, 20 of the plane- wave string 
formula ()3.2j) . (j3.7j) is an inherent property of PWMT and cannot be altered by any per- 
turbative redefinition of A. Only the derivatives of non-highest degree could be eliminated, 
and demanding their absence as a necessary prerequisite for BMN scaling uniquely fixes the 
renormalization 1)3.8)1 . Any differently renormalized coupling constant would prevent BMN 
scaling immediately. 

Hence, from now on we will work with the dilatation operator obtained from 1)2.27)) by 
redefining A according to the naive BMN scaling prescription 1)3.8)1 : 

D'r,0 = {} , 

D' r , 2 = 2{} - 2{0} , 

D' rA = -8{} + 12{0} - 2({0, 1} + {1, 0}) , 

= 6 °{} " 104 {°} + 24 ({°> !> + {!> °» + 4 {°' 2 i " 4 ({°> !» 2 i + I 2 , 1,0}) , (3.10) 
D' r S = -573{} + (1079 + 4a){0} - (283 + 4a)({0, 1} + {1, 0}) - 1^±^{0, 2} 

+ (80 + 2a)({0, 1, 2} + {2, 1, 0}) - (3 - 2a)({0, 2, 1} + {1,0, 2}) - ^{1, 0, 2, 1} 

- 4{0, 3} + 4({0, 1, 3} + {0, 3, 2} + {1, 0, 3} + {0, 2, 3}) 

- 10({0, 1, 2, 3} + {3, 2, 1,0}) + (2 + 2a)({0, 2, 1, 3} + {1,0, 3, 2}) 

- a({0, 1, 3, 2} + {0, 3, 2, 1} + {1,0, 2, 3} + {2, 1,0, 3}) 

One has to keep in mind, that these considerations only naively confirm the BMN scaling 
behavior of the renormalized dilatation operator, as the neglected boundary terms could 
(and actually do) ruin this scaling. Clearly the contact interactions of two magnons will be 
suppressed by a factor of 1/J with respect to the generic terms, however, such a suppression 
does not suffice to implement BMN scaling at higher orders in perturbation theory. This 
will be seen in Sec. 14.41 
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3.2 Distinction to Inozemtsev and Beisert-Dippel-Staudacher spin-chains 

The comparison is most easily done in the language of commuting permutations, which was 
used in [7]. Our PWMT-dilatation operator then reads 

D' r 2 = 2L - 2 P,i+i , 

i 

D' rt 4 = — 6L + 8 ^2 Pi,i+1 — 2 ^2 P i,i+2 ; 

i i 

D' rfi = AOL - 56 J2 p v+i + 16 Y p >*+ 2 + 453(P i ,i +2 Pi +1 ,i + 3 - P M + 3 P+i,i+ 2 ) , 

i i i 

L>; 8 = - 363L + 522 ^ P M+1 - ^Y P W ~ T Y P v+3 + 6 Y p >*+ 4 



^ Pi,i+lPi+2,i+3 + Y, Pi,i+3Pi+l,i+2 — Y, Pi,i+2Pi+l,i+3 

iii 

,j+3pi+2,j+4 + Pi,i+2Pi+l,i+4 + Pi,i+3Pi+l,i+4 



Pi,i+4p;+2,j+3 — Pi,i+4p;+l,j+2 — Pi ,i+iPi+l • (3-11) 

For the four-loop contribution we have committed ourselves to the choice of a = —2/3 for 
the free parameter of unitary transformations in order to obtain the same coefficient for all 
terms in the last two lines of (|3,11|) . Comparing to the results obtained in it is clear that 
we are not dealing with the Inozemtsev spin-chain here, as that would predict the five spin 
interaction terms with different coefficients, namely 

-^Inozemtsev = ^Y2 Pj,i+4 + 4 (P^ i+3 Pj+2,i+4 + Pt, i + 2 P + l ,i+4 + Pi, i+3 Pj + 1 ,i+4 

i i (3.12) 

— Pi,i+4pi+2,i+3 — Pi,i+4pi+l,i+2 — Pi,i+4Pi+l,i+3) + • • • • 

Curiously our five spin interaction terms do agree with the proposed four-loop dilation 
operator of N = 4 super Yang-Mills theory [H]. However, they differ in the two, three 
and four spin interaction terms. The precise discrepancy to the Beisert-Dippel-Staudacher 
dilatation operator Dg DS is 

D 'r,S ~ D f DS = 585 [~ L + 2 Y P M+1 " Y p >*+ 2 + Y p >*+ 3 

iii 

~ 2 Y2 P i>i+l-fi+2,i+3 _ Y2 P i,i+3 P i+l,i+2 + 2 Yj -^M+2pe+l,i+3] • 
i i i 

(3.13) 

Thus it is clear that we are facing a model, which is neither of the Inozemtsev nor of 
the "novel" Beisert-Dippel-Staudacher type. Nevertheless, the PWMT-dilatation operator 
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describes a perturbatively integrable long range spin-chain as we will indicate in the next 
subsection and confirm afterwards by deriving appropriate Bethe equations. 6 

3.3 Degenerate spectrum and higher charges 

In Tab. ^ we hst the (renormalized) four-loop anomalous dimensions 5A r of all su(2)- 
multiplets with states of length L < 9. Apart form the length which is equal to the bare 
dimension Ao, multiplets are labeled by the magnon number M. Alternatively one could 
use the su(2) Dynkin label a = L — 2M. Every multiplet, which is specified by the pair 
(L,M), is realized with a certain multiplicity m. All of these multiplets can mix among 
each other. Since the dilatation operator commutes with the parity operator, it is possible 
to disentangle the multiplets such that all states within one multiplet have equal parity p. 
This has been done in Tab.^ an d is indicated by m p . 

The crucial observation is that whenever there are multiplets of positive and negative 
parity with equal (L, M)-labels, then they form pairs of degenerate conformal dimension, 
which have been highlighted in the table through a bold font. This degeneracy is ascribed to 
a conserved charge and was the original indication of integrability 4 . In fact integrability 
implies the existence of a set {Qi} of as many conserved charges as there are degrees of 
freedom: 

[D' r ,Q i } = , [Qi,Qj] = Vi,j G {3,4,5,...} . (3.14) 

Here we have adopted the usual enumeration of the so-called higher charges starting from 
i = 3, cf. 14.11 All charges are Hermite an. Charges with even (odd) index (anti-) commute 
with the parity operator. Therefore the odd-indexed charges are responsible for the degen- 
eracy of the parity pairs. 

The charges have a perturbative expansion 

oo 

Q l {K) = Y, Kk r~ 1 Q^ ( 3 - 15 ) 
k=l 

and can be written in terms of permutations (|2.9j) . The higher the charge the wider its range: 
Qi,2k acts simultaneous on i + k — 1 adjacent spins. In App.Owe give Q% and Q4 up to k = 4. 

Let us go back to Tab.^and look at the state with labels (L, M) = (4, 2). It is given by 
tv[Z^ , Wl"] 2 |o) and is in fact a su(2, 2|4)-descendant of the Konishi operator tra^a^O) = 
tv[Z^ + W^W^ + Y' Y*] OV The anomalous dimension of Konishi is known up to three- 
loop HI El El ED • As we have done the explicit computation (j2.31|) , we dare to predict 

6 Looking through the literature, we find an old proposal for the SYM-dilatation operator in Ref. [1] 
which is equivalent to the PWMT one. Formula (F.3) of |1] with the replacement a — > — 1/4 and /3 — > a + 1 
matches exactly our above finding In 4 integrability was implemented but the implications of BMN 

scaling were not fully exploited yet. From this result, one could principally already infer the integrability of 
PWMT and the failure of BMN scaling. 
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L 


M 


rnP 


5A r /A r 


2 





1+ 





3 





l - 





4 





1+ 







2 


1+ 


12 - 48A r + 336A^ - 12771/4A^ 


5 





1" 







2 


1- 


8 - 24A r + 136A^ - 1024A^ 


6 





1+ 







2 


2+ 


(10 - y/5) - (34 - V5)A r + 5(1170 - 414^5)A^ - A(10695 - 4134^5)A^ 
(10 + y/E) - (34 + \/5)A r + 5(1170 + 414^5) K 2 r - \ (10695 + 4134^/5) A^ 




3 


1" 


12 - 36A r + 252A^ - 2484A^ 


7 





l - 







2 


2 _ 


4 - 6A r + 37/2A^ - 335/4A^ 

12 - 42A r + 555/2A2 - 9465/4A^ 




3 


1+ 


10 - 30A r + 200Aj? - 3565/2A;? 






1- 


10 - 30A r + 200Aj? - 3565/2A;? 


8 





1+ 







2 


3+ 


3.01 - 3.32A r + 7.66A^ - 27.30A^ 
9.78 - 29.22A r + 167.64A2 - 1256.15A3 
15.21 - 59.45A r + 456.70A^ - 4430.55A;! 




3 


1+ 


8 - 20A r + 112Ajf - 842Aj? 






2 _ 


8 - 20A r + 112Aj? - 842Aj? 

12 - 36A r + 264A^ - 2592A^ 




4 


3+ 


6.49 - 7.56A r + 10.22A^ + 10.25A^ 
10.90 - 31.76A r + 249.76A2 - 2538.27A^ 
22.60 - 88.68A r + 636.03A;; - 5933.98A;! 


9 





l - 







2 


3 - 


(8 - 4^2) - (26 - 17V2)A r + ±(179 - 993^2)A^ - ^(51376 - 36103V2)A^ 
8 - 20A r + 98A^ - 639 A^ 

(8 + Ay/2) - (26 + 17^)A r + ±(179 + 993\/2)A^ - ^(51376 + 36103\/2)A;! 




3 


3+ 


6.45 - 13.18A r + 60.73Aj? - 378.34A? 
11.04 - 33.50A r + 231.93A? - 2149. 91A? 
16.51 - 55.32A r + 383. 33A? - 3494.75A^ 






3" 


6.45 - 13.18A r + 60.73Aj? - 378.34A? 
11.04 - 33.50A r + 231.93A? - 2149.91A? 
16.51 - 55.32A r + 383. 33A? - 3494.75A^ 




4 


1+ 


10 - 30A r + 220Aj? - 4185/2A;? 






3" 


(12 - 4V3) - 18(2 - V3)A r + ±(456 - 255V3)Aj; - ±(7908 - 4541 V3)A^ 
10 - 30A r + 220A? - 4185/2 A? 

(12 + 4^3) - 18(2 + ^/3)A r + ±(456 + 255\/3)A^ - ±(7908 + 4541^/3)A;! 



Table 1: Particle spectrum in su(2) subsector 
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the four-loop value under the assumption of a valid PWMT-SYM-relationship 



5A r = 12A r - 48A r 2 + 336A;? - ±^±A^ 

_ 3A™ _ 3A^ 21A3 M _ 12771A* M (3-16) 
4vr 2 16vr 4 256vr 6 262144vr 8 ' 

where we used ()2.22(l for the renormalized coupling. It would be extremely nice to test this 
conjecture through an explicit four-loop computation in the full superconformal M = 4 
gauge field theory. 



4 Bethe ansatz for plane-wave matrix theory 

In this section we shall deduce the long-range Bethe ansatz allowing for an exact diagonal- 
ization of the obtained PWMT-Hamiltonian in the planar limit. We use a technique which 
was very recently developed in [22] and called "perturbative asymptotic Bethe ansatz" . At 
first we derive the Bethe equations in the two-magnon sector. These are straightforwardly 
generalized to the full A^-magnon sector and verified for a number of three and four-magnon 
states through numeric diagonalization. We then use the Bethe ansatz to solve the complete 
two-magnon problem, which explicitly reveals terms that violate BMN scaling. 



4.1 Spin-chain language 

For the following discussion we switch the interpretation of the effective PWMT-Hamiltonian 
T, which intermediately became the PWMT-dilatation operator D' r , a second time. Now, 
the anomalous piece of the dilatation operator 5D' r is regarded as the Hamiltonian Q 2 of 
an su(2) spin-chain 

Q 2 :=A; 1 5D r = A; 1 (D r -D rfi ). (4.1) 

The spin-chain is in accordance with our previous manner of speaking still given by the 
single trace su(2) states of PWMT. With notation (|4.1|) the Hamiltonian joins nicely the set 
of higher charges {Qi>3}. The label Q\ is reserved for the spin-chain momentum operator, 
which in our case is identically to zero due to the cyclicity of the trace. We denote the 
eigenvalues of Qi by corresponding small letters qi. The following table lists equivalent 
quantities: 



PWMT 


SYM 


Spin-chain 


energy shift operator ST 
energy shift 5E 
free energy Eq 


anomalous dilatation operator 5D' r 
anomalous dimension 5A r 
bare conformal dimension Aq 


Hamiltonian Q 2 
energy q 2 

spin-chain length L 



4.2 Two-magnon scattering 

The application of the dilatation operator to a spin-chain causes motion and interaction of 
the magnons. Integrability implies that only two magnons interact at a time. Therefore it 
is possible to deduce the general case from studying the two-magnon situation. However, 
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one must not use the cyclic two-magnon states (|3.1|) |22| where only the distance of the 
magnons matters (or alternative their relative momentum). Once we place more magnons 
onto the spin-chain the position (or momentum) relative to the new insertions becomes 
important. Thus we need to investigate open two-magnon states, the building blocks for a 
general state: 

\O h>h ) := (Z^- x W\Z^) h - h ~ x W\Z^) L - h |0> for 1 < h < l 2 < L - 1 . (4.2) 

The dilatation operator mixes all of these L{L — l)/2 states. We superpose them to eigen- 
states parametrized by two yet undetermined complex variables p\ and p 2 

L 

\Pl,P2) ■■= ^2 a (h,h,Pi,P2)\Oi li i 2 ) . (4.3) 

This may be regarded as a kind of Fourier transformation, the magnons are now labeled 
by their (quasi-)momenta p\ and p 2 . The Bethe ansatz for a(lx, h,Pi,P2) resembles an 
in-coming and a scattered out-going wave 7 

a(h,l 2 ,Pi,P2) = e i(pi ' 1+P2 ' 2) /(/ 2 -/i,Pi,p 2 )+S(p 2 ,pi) e i ( pi ' 2+P2il )/(L-Z 2 +Zi,pi,p 2 ) . (4.4) 
This ansatz will exactly diagonalize the spin-chain Hamiltonian 

Q2\pi,P2) = Q2(P1,P2)\P1,P2) • (4.5) 

The key ingredient of the ansatz is the S-matrix S(pi,pz), which describes the scattering 
of two magnons and solely determines the eigenvalue of |pi,p 2 ). In the following we will 
determine the S-matrix perturbatively up to order Aj? 

S(pi,P2) = S (pi,p 2 ) + A rSi(pi,p 2 ) + k 2 r S 2 {pi,P2) + A?5 3 (pi,p 2 ) + ••• • (4.6) 
The function f(l,pi,P2) takes the special perturbative form 

f(l,Pi,P2) = 1 + Aj./o(Z,pi,pa) + A l r +1 fi(l,Pi,P2) + K +2 f2(l,Pi,P2) + ■■■ (4.7) 

and has been introduced to account for boundary effects where the magnons come close to 
each other. The fact that the range of the spin-chain Hamiltonian depends on the loop order, 
explains why the order of the leading correction in (|4,7|) depends on the magnon separation 
I = I2 — h- It turns out that the precise form of the function f(l,pi,P2) is irrelevant for the 
energy eigenvalue g 2 (pi,p 2 )- 

There is another "boundary effect" where the magnons approach either end of the open 
spin-chain l|4.2j) . i.e. where 1% « 1 or Z 2 ~ L. Since the spin-chain Hamiltonian (|4.1)) . ([3.1L)|) 
is not well-defined for open chains we will neglect these cases and assume l\ to be large 
and I2 to be small enough. Moreover we may replace the function f{L — / 2 + l\,p\,p%) in 

7 Notice the reversed order of the arguments in S. It will turn out that S(p\,p2) = [S(p2, pi)]^ 1 . Also 
note that the detailed form of our ansatz differs slightly from the one considered in |22) . 
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the scattered piece of the wave function by 1, if we also assume L to be large enough, 
because the perturbative corrections will only contribute at order A i— ' 2+Jl . The reason for 
introducing / at this point lies in the derivation of the Bethe equations from the periodicity 
conditions, cf. (|4.15|) . 

It will turn out that despite these assumptions the wave-function and the eigenvalues 
are exactly (and correctly) determined and hence the S-matrix, the Bethe equations, and 
the energy formula as well. Since this is obviously not a rigorous derivation we shall check 
the result for a number of cases by explicit numerical diagonalization for low values of L. 

Now, in order to determine the unknown functions of the ansatz (|4,4|) from demanding 
the eigenvalue equation (|4,5|) . we apply the spin-chain Hamiltonian to the states (|4,3|) . We 
do this order by order in A r . The action of the Hamiltonian on |Oz 1 ,z 2 ) * s given by 8 

4\O hM ) - 2\O h - lih ) - 2\O lul2+1 ) for l 2 = h + l 

(4.8) 

8\O hM ) - 2\O h±hh ) - 2\O hM±1 ) for l 2 > h + 2 
-8|0, l , Ia ) + 8|0 Il _i, fa ) + 8|0, 1) , a+ a> 

-2|0 Il _i )fa _ 1 > - 2|0 Il+ljfa+1 > - 2\O h _ 2M ) - 2\O hM+2 ) for l 2 = l x + 1 
-28\O hM ) + 8\O h±lh ) + 8\O hh±1 ) 

-2\O h - 2ih ) - 2\O hjh+2 ) for l 2 = h + 2 

-24\O lul2 ) + 8|0, 1±1 ,, a > + 8\O luh±1 ) 

-2\O h±2M ) - 2\O hil2±2 ) for l 2 >h + 3 

(4.9) 

and similar more involved expressions for Q 2 $ and Q 2 $ which we refrain from stating 
explicitly here. Due to the extended range of the interaction, we need to consider k special 
cases for Q 2 ^\ for l 2 > l\ + k + 1 the formula becomes generic. Clearly the action of 
the dilatation operator is not diagonal in this basis. In the superposition (|4.3j) . however, 
we can adjust the coefficients a(li,l 2 ,pi,p 2 ) such that the right hand side becomes again 
proportional to \pi,p 2 )- 

Observe that the action of the dilatation operator in the generic case is symmetric 
in l\ and l 2 . Hence, both the in-coming and the out-going waves acquire the same factor 
upon action with the dilatation operator originating from the exponentials. This factor is 
independent of the function / since in the generic case the separation of the magnons is so 
large that only the leading 1 of the function / of (|4.7[) contributes. It is also independent of 
the S-matrix which does not depend on the magnon positions at all. Therefore the generic 
equation determines the eigenvalue q 2 {pi,p 2 ) alone. The eigenvalue splits into a sum 

<?2(Pl,P2) = 92 (Pi) + Q2(P2) (4.10) 
8 We use similar shorthands as before, e.g. \Oi 1 ± m j 2 ) := C^-m^) + C( 1+mi ; 2 ). 



Q2 > 2\Ol 1 ,l 2 ) — 



Q2,4\°h,h) = < 
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with 



(4.11) 



q 2 (p) = 8sin 2 (§) - 32A r sin 4 (§) + 256A 2 sin 6 (§) - 2560A^ sin 8 (§) + . . . . 

This function is called "dispersion relation" or one-magnon energy. The boundary equa- 
tions determine particular perturbative orders of S and particular values of the fiS. We 
summarize this in the following table: 





D 2 


D 4 


^6 


D 8 


h > h + 5 


l2{p)\o(A° r ) 


«zCp)Io(ai) 


«2(p)|o(A?) 


Q2(p)\o(M) 


h = h + 4 


/o(3,pi,P2) 


h = h + 3 


/o(2,pi,p 2 ) 


/l(2,Pl,P2) 


h = h + 2 


/o(l,Pl,P2) 


/l(l,Pl)P2) 


/2(l,Pl,P2) 


h = h + l 


So{pi,P2) 


5 , i(pi,p 2 ) 


S2(P1,P2) 


S'3(Pl,P2) 



The form of / is not very enlightening as it depends on the basis (expressed by the parameter 
a of -Ds)- The S-matrix is of course independent of a and we find 



S(pi,P2) 



Jtl>(pi,p 2 ) 



(4.12) 



32 A 2 sin 4 (|) + 256A3 sin 6 (§; 



+ 



(4.13) 
(4.14) 



y(Pi) - ^(P2) - i 

with the phase function 

p(p) = Icot(|)[l + 8A r sin 2 (| 
and the exponent 

i ) {Pi-,P2) = 104A^ [sin 2 (^-) sinp 2 sin 2 (^-) - pi <-» p 2 ] 

These formulas provide an eigenstate |pi,p 2 ) with eigenvalue q2{pi,P2) for any value of p\ 
and y»2- The discrete physical spectrum is specified not before imposing boundary conditions 
which lead to the Bethe equations and pick out a discrete set of Bethe momenta. 
In the two-magnon case we impose the periodicity condition 

a{h,h, Pi, P2) = a(l 2 ,h + L,p 1 ,p 2 ) . (4.15) 

Now it pays off that we introduced the same function / for in-coming and out-going wave, 
since this leads independent of / to 



exp(iLpi) = S(pi,p 2 ) = exp(-iLp 2 ) 



(4.16) 



These are the two-magnon Bethe equations. Due to the originating trace structure the 
states of the spin-chain need to be invariant under cyclic permutations. This leads to the 
additional zero total momentum condition 

Pi+P2 = 0. (4.17) 

The equations Q4.16JI and (|4.17f) completely determine the two-magnon spectrum, cf. Sec. 14.41 
Finally let us remark that the energy shift of the four-loop "wrapping" state of Sec. 2.4 
may also be brought into the apparently universal form ()4.12|) with the identical phase 
function (|4.13f) and the exponential factor ip wia p(p, —p) = " Aj? sin 4 (|) sinp which only 
differs from (|4.14j) in the overall constant up front 9 . 
9 We thank S. Frolov for a discussion on this point. 



23 



4.3 Bethe equations 

The case of a spin-chain with M magnons and total length L is a straightforward gen- 
eralization of 1)4. 16|) . According to the notion of [221) m an hitegrable system a general 
scattering process is always factorized into two-body scattering sub-processes. Therefore 
one just needs to replace the right hand side of 1)4.16)) by a product of S-matrices. Any 
magnon, represented by p^, interacts once with every other magnon Pj^kk on its way around 
the spin-chain: 

eMiLPk) = fl Sip*,*) = fl (4.18) 

j^k jjtk 

for k = 1, . . . , M. This is supplemented by the zero total momentum condition 

M 



X) Pi 



. (4.19) 



As the derivation of 1)4.18)) contained some subtle points we have checked the Bethe equation 
for a number of states. The results are documented in App. iDl 

Let us now compare this form to the other known perturbative long-range Bethe ansatze 



in the literature. We note that the dispersion relation 1)4.1 ljl and the phase function 1)4.13)) 
are exactly the same as for the Beisert-Dippel-Staudacher spin-chain [S]. The difference 
lies in the presence of the exponential factor in the Bethe ansatz for the PWMT. This 
exponential is reminiscent of the Bethe ansatz for quantum strings proposed by Arutyunov, 
Frolov and Staudacher Jl] . There the exponent is a series of antisymmetric products of all 
conserved charges. This is exactly the same as we observe here, since our exponent is in 
fact 

13A 3 

i>(pi,P2) = — [92(^1)93(^2) - qz{pi)q2{p2)] (4.20) 

where 

q 2 {p) =8sin 2 (|) , q 3 (p) = 8sin(p) sin 2 (|) (4.21) 
are the lowest order contributions to the eigenvalues of the higher charges Q2 and Q3, 



cf . 1)4.11)) and ()D.2)) . A more detailed comparison to the quantum string ansatz reveals that 
our exponent has one additional power of the coupling constant A r . This explains why the 
exponential yields a breakdown of BMN scaling in our case, while it results in a sub-leading 
discrepancy in the near-BMN limit with respect to SYM theory in the quantum string case. 

The existence of an exponential factor in the Bethe ansatz seems to be the generic form 
for integrable long-range spin-chains. It would be very interesting to determine whether the 
Bethe ansatz for the Inozemtsev spin-chain may be brought into this general form as well. 
The idea would be to trade contributions in the phase function for an exponential factor. 
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4.4 Two-magnon spectrum and violation of BMN scaling 



Using the above Bethe ansatz one may solve the two-magnon problem explicitly. Here, due 
to the zero total momentum condition p\ = —p2 =: p, one finds just one Bethe equation 



exp(zLp) 



<p(p) - i/2 

This equation is solved via the perturbative ansatz 

p = Po + KPl + k?rV2 + ArP3 + • • • , 

plugging this into (|4.22|) and making use of (|4.13|) one finds the explicit result: 
2nir 

Po 



(4.22) 



(4.23) 



Pi 

P2 
P3 



L — l 

l 6cos( -_) sin 3 ( 



L—l 

64cos(jg r )sin 5 (jg r ) 
{L-lf 

32cos( -_ )sin 5 ( 



2L + (L + 3) cos(f^) 



(4.24) 



3(L-1)3 



(-79L 2 + 122L - 79) + 4(5L 2 - 31L + 20) cos(f^ 



+ 4(4L 2 + 13L - 20) cos(f^) + 4(L 2 -7L + 10) cos(^) 



L-21 . 



Inserting these expressions into the Q4.1UJI yields, upon expansion in A r , the closed expres- 
sion for the two-magnon spectrum 5A r = A r q2 parametrized by the integers n and L, where 
L denotes the length of the spin-chain and n = 0, . . . , [ 

5A r (n,L) = 16A r sin 2 (f^ I ) 

" T~i sin4 (S) [(L + 1) + 2 cos(f^) 
95fiA 3 r 

+ t^JI ^(f^i) [i 2L2 +5L-2) + 2(5L + 2) cos(f^) + (L + 4) cos(|^ 

25fiA 4 r 
+ 3(L _i )3 sin6 (7S) [(- 30 ^ 3 " 39L2 + 51L ~ 39 ) 

+ 3(10L 3 - 49L 2 + 26L - 13) cos(|^) + 8(11L 2 - 10L) cos(^) 

+ (18L 2 + 90L - 30) cos(f^) + (2L 2 + 17 L + 30) cos(ff^) 

Expanding this in the BMN limit, i.e. A T ,L — ► oo with A r /L 2 fixed, one finds 



(4.25) 



lim 5A r (n, L) = 4 4-K z n 



A r ocL 2 



2 2 Ar 



z 2 



6656vr 6 n 6 ^ 



4 [ 47r 2 n 2 



Z 2 



+ 8 4vr z n 



,A r 



2 2 Ar 



L 2 



20 ( 4^ 2 n 2 -^ ] + . . . 



(4.26) 
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This is in fact divergent due to the four- loop term ~ A^/L 7 : the proclaimed breakdown 
of BMN scaling. This formula is to be compared with the proper scaling behavior Q3.7|) 
with (L ~ J). The additional (violating) piece originates from the microscopic contact 
interactions of two magnons. 

5 Conclusion and discussion 

In this paper we have performed a fourth order perturbative analysis of the SU(iV) plane- 
wave matrix theory in its planar large N limit in the closed minimal su(2) subsector. 
The outcome of our analysis yielded an effective planar Hamiltonian, which is integrable 
to this rather high loop order. Additionally we have explicitly constructed the first two 
higher commuting charges of the integrable system to the corresponding loop order, which 
is summarized in App.O We take these results as a very strong indication of the complete 
integrability of the planar matrix theory system and similarly so for the closely related 
"mother" theory of superconformal N = 4 Yang-Mills. 

Moreover we determined the underlying perturbative asymptotic Bethe ansatz |22| of 
the system for states of length larger than four. While the form of the integrable spin-chain 
Hamiltonian (or planar dilatation operator or effective matrix model Hamiltonian) becomes 
more and more involved at higher loops, the "core" of the model - the two-magnon S-matrix 
S(pi,p2) ~ is of the intriguingly compact form 

S(pi,P2) = - ^ 2 \ + \ x exp (i/(A r )f>;g m ( Pl )q t+2 (p 2 ) - pi ~p 2 ) , (5.1) 

which was first established for the quantum AdS^ x S 5 string Bethe ansatz in ^3] • Here qi (p) 
denotes the higher charges of the spin-chain, /(A r ) a function of the coupling constant and 
f(p) the phase relation of the model. This form of the S-matrix appears to be generic for in- 
tegrable long-range spin-chains: Except for the Inozemtsev chain all known long-range spin- 
chains or Bethe ansatze (the Beisert-Dippel-Staudacher chain [S], the Arutyunov-Frolov- 
Staudacher quantum string chain ^3]) ° ur plane- wave matrix theory effective Hamiltonian 
and the recently established S-matrices of the s((2) and su(l|l) M = 4 dilatation operators 
22 ) fall into the class (|5.1|) with the same functions ip(j>) and qi{p)\ 10 It would be very 
interesting to establish whether the Inozemtsev chain S-matrix may also be brought into 
the form of Eq. (|5.1|) . 

Our perturbative computation was only attainable through the use of a tailor made 
computer program which generated all planar graphs and performed the necessary term 
algebra in a massively distributed fashion, which is reviewed in detail in App. The 
program should also be applicable for similar problems in planar perturbation theory at 
high orders, in particular for a similar analysis in larger closed subsectors of the plane- wave 
matrix model, such as su(2|3). 

10 Strictly speaking this is not quite correct as the fermionic su(l|l) chain has a trivial phase relation 

<p(p) = o liu. 
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The motivation for this computation grew out of the close relationship of the effective 
plane-wave matrix model Hamiltonian to the dilatation operator of N = 4 super Yang- 
Mills, which is firm in the joint minimal su(2) subsector up to the three loop order |21l IS]. 
This connection stems from the consistent (classical) reduction of the full field theory on 
a three-sphere [5] and is seemingly stable under quantum corrections (up to three-loops) 
resulting in a "renormalization" of the relation between the matrix model (dimensionless) 
mass parameter M and the super Yang-Mills coupling constant <?ym 

A = A r + l\ 2 r - 11A? + + O (A r 5 ) (5.2) 

2 lb 

where A = 2iV/M 3 and A r = gy M N/ATT. Here the highest order term was determined 
by requiring "naive" BMN scaling, i.e. demanding that the four-loop contribution to the 
effective Hamiltonian acts as a clean lattice laplacian to the fourth power on well separated 
magnons. It remains to be shown whether the established correspondence of plane-wave 
matrix theory to the N = 4 dilation operator of the full uncompactified field theory extends 
beyond the three-loop level (in the planar sector). For this the four- loop anomalous scaling 
dimensions of one or two super Yang-Mills operators would need to be computed. Hence 
unfortunately our verdict on the perturbative BMN scaling of the full N = 4 gauge theory 
at four loops cannot be conclusive, it, however, does certainly question its existence beyond 
three-loops. A better understanding of the inner workings of the planar plane-wave matrix 
theory /M = 4 super Yang-Mills correspondence on the basis of an effective field theory 
approach in which one integrates out the higher Kaluza-Klein modes of the 4d gauge theory, 
would be of great importance. 

Our results also shed light on the structure of the "wrapping" interactions for long-range 
spin-chains, which are inaccessible through the means of the asymptotic Bethe ansatz where 
the length of the chain needs to be strictly larger than the spread of the local Hamiltonian: 
In the considered su(2) subsector this happens for the first time at four-loop order, where the 
resulting spin-chain Hamiltonian involves five neighboring spin interaction terms, whereas 
the shortest non-protected operator in this subsector is a Konishi descendant of length 
four: tr[Zt, W^] 2 . Its four-loop energy shift can neither be accounted for by any "natural" 
prescription of how to act with the five spin terms of the Hamiltonian on a length four state 
nor by blindly applying the Bethe ansatz. One finds 

2958A^ (wrap around) 
3026A^ (discard long-range terms) 
3054A^ (Bethe ansatz) 
^P-Af. (true value) 

More work is needed to understand how this intrinsic restriction of the asymptotic Bethe 
ansatz could be overcome. 

Finally it would be very nice to establish a proof of the integrability of the planar 
plane-wave matrix theory. Compared to the situation for the J\f = 4 super Yang-Mills field 
theory this appears to be more tractable, as one has access to the full non-perturbative 



5A r = 12A r - 48A 2 r + 336A 3 - < 
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Hamiltonian. However, efficient techniques to isolate the planar large N sector in a non- 
perturbative fashion are lacking. 
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A Details on the Computation 



The problem solved by the gemstone code (which was developed for this application) is the 
generation of planar multiloop graphs from large sets of vertices with directed and colored 
legs, which have a term algebra associated to them, with a highly efficient algorithm that 
furthermore is implemented close to the machine language level for maximum speed. 

Considering the technical complexity of the calculation of the dilaton operator at four 
loops, the authors have strong reason to believe that this may well be the largest symbolic 
calculation performed so far 11 . 

The key to its successful completion was the development of aggressively optimized sym- 
bolic algebra code. With this program, which the authors named gemstone 12 , the three-loop 
calculations that have been done in j^J (by using the already highly optimized FORM 26 
symbolic manipulation program and a more conventional algorithm) could be checked in 
about 1/100 of the original calculation time. 

As this code is at present the best tool available to systematically do many more planar 
higher-loop matrix model calculations, and due to its modularity should be easily adoptable 
to planar high-loop field theory calculations as well, it might be quite interesting in itself 
and has been included in the source archive of the arXiv . org preprint of this work at 
http : //www. arxiv . org/e-print/hep-th/04 12331 

Furthermore, as the results presented in this work hinge on the correctness of the al- 
gorithm as well as its implementation, the scientific demand of verifiability gives another 
strong incentive to make this code publicly available. 

Finally, many of the approaches to the problem of making the calculation fast are generic 
enough to be useful in many other situations, even if not of direct relevance to the physics 
of this problem. As this appendix may be interesting to computer scientists as well, it is 
essentially self-contained. 

A.l The Task 

In a nutshell, the problem at hand consists of the determination of perturbative quantum 
corrections to the effective Hamiltonian that come from interaction diagrams which can 
be drawn without self- intersections (i.e. are planar). Conceptually, it may be obtained by 
a simple algebraic normal ordering procedure with subsequent isolation of those contribu- 
tions that can be associated to planar graphs. While this approach is very simple, it is 
quite limited in practice by excessive growth of the number of terms when increasing the 
perturbative order. As both quantitatively and qualitatively new phenomena appear when 
going to higher orders, doing such calculations is not merely an academic exercise. Hence, 
one should make use of as much of the specific structure of the problem as possible in order 
to reduce the complexity of high-order calculations. In our case, this especially means to 
make as much use of the planarity property as possible right from the beginning, instead of 
generating all terms and later throwing away contributions that can be identified as coming 

11 There are far larger massively distributed prime searches as well as encryption breaking attempts, but 
the underlying questions should perhaps not be regarded as being of symbolic nature. 

12 as compactness of code and flawlessness are the most important virtues for such a task 
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from nonplanar graphs. While the number of nonplanar graphs also increases dramatically 
with increasing number of vertices, there are nevertheless far less planar than non-planar 
graphs in all problems of interest. 

We have to find all graphs with certain pre-defined properties, associate a term to every 
graph, and sum all the contributions from allowed graphs. Graphs are built from vertices, 
which may have up to four legs, by successively adding vertices and fusing legs. Every vertex 
leg has a direction (in-going or out-going, corresponding to particle creation or annihilation) 
as well as a color (corresponding to particle type). Only matching legs (same color, different 
directions) may be fused. To every vertex, we associate an algebraic factor depending on 
the type of the vertex. Leg fusion induces algebraic transformations. 

Vertices have three or four legs, except for some degenerate cases where self-binding of 
a four-leg vertex gives a two-leg vertex. The factors associated to three-leg vertices contain 
a single power of the interaction strength g, while those associated to (possibly degenerate) 
four-leg vertices contain a factor g 2 . These two large vertex classes are further subdivided 
by considering the energy shift associated to particular vertices - this is just the mass of the 
particles created (represented by out-going legs) minus the mass of the particles destroyed, 
and can be represented by a small integer. With this additional distinction, there are 15 
different vertex classes, each containing between 1 and 188 different types of vertices. 

In a first step, we expand the Hamiltonian into a set of sequences of vertex classes. As 
this is of minor relevance for the heart of the calculation, this step is discussed separately 
in section [A. 41 For the fourth-order calculation, this yields 338 834 individual sequences, 
each carrying an interaction strength factor g 8 . 

Next, an initial closed chain of vertices with all legs on the outside has to be chosen. 
Starting from this chain, we have to consider every single sequence of vertex classes individ- 
ually, finding all possible realizations of that sequence as a sequence of vertex types. These 
building blocks then have to be fused subsequently to the initial chain in all possible ways, 
subject to some extra constraints. The fundamental graph operations, such as fusing legs, 
induce transformations on the associated terms that have to be kept track of. Eventually, 
all terms from admissible final graphs have to be summed. 

As an example, the vertex class sequence consisting of four times the largest class with 
188 four-leg vertex types in it alone gives rise to 188 4 = 1 249 198 336 individual sequences 
of vertex types. For every single one of those, all possible ways to add these vertices in 
sequence to the starting graph have to be taken into consideration. Likewise for the other 
338 833 sequences. This is depicted schematically in figure ^ 

For the determination of the four-loop dilatation operator, it suffices to form all additions 
of all these sequences of vertices to a suitably chosen initial chain of length eleven. 

Vertices to be added may be turned, but not flipped over. Every final graph must obey 
the following constraints: 

1. It is planar. 

2. All the open legs are on the outside. 

3. Its construction by successive addition of vertices never involved a step where an 
in-going leg remained open (as this would act on the vacuum and give zero). 
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Figure 1: Forming all Graphs: One particular choice where the last two vertices to be added are of 
the same type. 
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As an extra difficulty, some legs are fermionic, which means that care of extra factors — 1 
has to be taken which occur in some (but not all) situations when such legs are fused. 

A. 2 Choice of the Platform 

As a basis for the implementation, the Objective CaML [3U] system has been chosen (an 
early prototype was written in LISP). This decision was based on the expectation that 
the four-loop calculation might become almost prohibitively large, and hence one should 
be able to make use of computer power donated by volunteers to run the calculation in 
a highly parallelized scheme, similar to efforts like SETIOHome |31| . GIMPS an d many 
others. This requires the availability of a high-quality native code compiler for a reasonable 
price (ideally for free) for all intended target platforms, especially including Linux/x86, Mi- 
crosoft Windows, Mac OS X, and possibly even Linux on non-PC hardware. This excludes 
LISP as an implementation language, since there is no free LISP Compiler of sufficient 
strength available for the Windows platform. Java as well as C (or C++) are not con- 
sidered viable options as well, as some of the algorithmic tricks that had to be used are 
prohibitively clumsy to express in such languages that do not provide sufficient support 
especially for lambda abstraction; furthermore, code development and in particular debug- 
ging is facilitated greatly by the availability of an interactive command prompt where one 
can immediately test program components individually. This is also not widely available 
for the well-known mainstream languages. 

The decision to use Objective CaML was further supported by Ocaml's excellent record 
for being a versatile tool to quickly build solutions for challenging problems [2H]> avail- 
ability of key algorithmic libraries (hashes, rational numbers, etc.), as well as quite positive 
experience with this system in an earlier research project from the side of one of the authors. 

A. 3 Planar Graph Generation 

As explained, we are interested in planar graphs only, as this restriction will considerably 
constrain the number of graphs. 

In terms of data structures, one attractive way to organize this calculation is to represent 
every graph as a vector of connected components, where every connected component is a 
vector of domains (cells) that carry legs along their boundaries and are glued together in 
such a way that all legs point to the inside. To every connected component, we associate a 
particular term - details of the term algebra will be discussed in the next section. 

The cells may themselves be represented as vectors of legs, and every leg has to know 
about its type (direction and color) and furthermore some kind of pointer (in a general, 
not implementation-specific sense) into the term of the leg's connected component. In the 
implementation, all this information is recorded in a single signed 30-bit integer per leg. 
We make use of the observation that for a four-loop calculation, we will introduce at most 
8 • 3 = 24 legs from vertices in addition to the 11 legs in the initial sequence. This means 
that we can conveniently label legs by 6-bit numbers in the range . . . 35. (Additional legs 
may be introduced by tracing fermionic loops of odd length - this gives an extra factor e l ^ k 
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Figure 2: Fusing legs 



from the projector. There may be at most two such factors in our calculations, so we take 
labels for these indices from the upper end of the 6-bit integer range.) 

Fundamental operations on graphs are (1) the addition of a new vertex, which initially 
is regarded as an individual connected component with only one cell, (2) the composition of 
two connected components right before legs between them are fused: this causes one entry 
to be removed from the vector of all connected components and another one to be enlarged, 
as well as corresponding transformation on the vector of associated terms, where one entry 
is removed and multiplied into another one, and (3) connection of cells by fusion of legs. 
This reduces the number of cells in a connected component and as well induces certain 
transformations on the associated term. 

Concerning the fusion of legs, it is helpful to think of each individual connected compo- 
nents as living on a sphere. When legs are fused, these spheres are punctured at the interior 
of the partaking cells and then connected appropriately, see figure [21 

Note that steps (1) and (3) introduce a choice among many different possibilities. In 
(1) this comes from (in some cases, quite many) different vertex types contributing to a 
vertex class, all of them having individual leg structure, hence requiring to branch graph 
generation, in (3) this comes from the possibility to connect the legs from two domains 
in multiple different ways. In the implementation, the technique of continuation coding 
turned out to be a very convenient way to express the branchings in these steps of the 
calculation. That is, one passes to the function b that branches the calculation a function / 
which is to be called with an individual branching and does all the remaining processing for 
this branch. ("Passing around the entire rest of the calculation as a function argument.") 
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This is then called multiple times, once for every branch. The abstract idea of passing 
around code towards the data that have to be processed - like an inverse assembly line that 
transports workers to partially completed pieces - is a re-occurring pattern in the design 
of the entire system. While it is nice and highly expressive to write the code in such a way, 
dynamic memory allocation (aka 'consing') of functions with partially fixed parameters 
(aka 'closures') is a bit wasteful in terms of efficiency. With a lot more work, some (limited) 
additional efficiency improvement might be gained here. 

A further subtlety comes from the requirement to properly treat signs from fermionic 
loops. This is done by subsequently recording the types of all vertex's legs (fermionic/non- 
fermionic) in a special bit-vector in the graph (in the implementation, this is distributed 
out over two 30-bit integers) whenever a new vertex is added, and removing the fermionic 
leg bits again whenever two such legs are being fused, introducing an extra minus sign if 
there is an odd number of active fermionic legs whose label lies between the labels of the 
fused legs. (Care must be taken to ensure that one does not apply such modifications to 
the graph data structure in such a way that they are not being undone properly once we 
go back to do a different branch of the calculation.) 

A final trick is to make use of the property that in every vertex addition step, the only 
legs that may be fused are those that were introduced by the vertex. It may well be that 
this eventually leads to the combination of (at worst) five different connected components 
(the vertex itself and four other components that bind to its four in-going legs), but this 
can be done sequentially, and in every such absorption step, we only have to remember the 
presently 'hot' connected component plus the cell where the vertex originally was absorbed, 
and the positions of the vertex's original still unbound legs in the new cell. As every leg 
can be labeled with a 6-bit integer, and as we at most have four active ('hot') legs at any 
step during binding, it becomes attractive to encode a list of four legs in one 28-bit integer, 
seven bits per leg, with 127 = IIIIIII2 as a special 'end of list' stop code. Implementing 
primitive list operations on such short lists of small integers brings the benefit that no 
expensive memory management is required, plus a lot of work can be done very close 
to the processor's register file. Furthermore, should microprocessor design evolve into the 
highly desirable direction to have certain parts of freely reconfigurable logic on the CPU 
(cf. e.g. [2Zj) during the next decade, this specific part of the calculation could easily be 
moved directly to the hardware level, so that it could be done at virtually no cost at all. 

The term algebra is fully abstract, which means that one may hook arbitrary imple- 
mentations that provide notions of fusing terms and legs into the combinatorial engine. In 
particular, two term algebras that are particularly nice for debugging are the trivial one 
that just records all fusions in a list, and one that produces a graphviz [25j input file for 
visual rendering of individual graphs to check that the algorithm is working correctly. 

A. 4 Operator Sequences 

The set of all vertex sequences that have to be applied to the initial chain is determined by 
expanding the effective Hamiltonian (jB.lJ) in terms of vertices of definite energy shift. This 
allows one to shift all projectors and propagators out of the Hamilton operator. The corre- 
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sponding rules are (V± denoting three-legged vertices and Vi denoting four-legged vertices): 




k 



(A.l) 
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Thus, the one-loop operator (compare (B.5)) 
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is unfolded to a sum of seven terms: 
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At four-loop order, the corresponding operator contains 339 summands, which likewise 
unfold to 338 834 contributions (compare (B.8)): 



Every such factor V. t . may again be a sum of up to 188 contributions, each one with its own 
leg structure. 

While this is sufficiently easy to do with a small FORM program within about 15 
minutes of calculation time on 1.7 GHz Pentium-IV hardware, we note that just as well 
here, it is possible with very little effort to speed up this part of the calculation by a 
factor of about 300 by using compiled (e.g. Lisp) code. As propagators and projectors are 
shifted through the term, it makes sense to just record and operate on the positions of all 
these factors that still are present instead of generating new terms even for shifts that only 
introduce an extra factor. One key observation is that there are only 15 different types 
of vertices, so one can encode a sequence of up to eight vertices very conveniently in a 
32-bit number. (As the authors originally hoped to be able to go even beyond level four, 
and as the Lisp system with which they performed this expansion does not support 64-bit 
numbers, they abused 52-bit floatingpoint mantissas for this purpose.) These tags for vertex 
sequences are used as hash keys to collect contributions to the various sequences. 
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A. 5 Term algebra 

Playing around with the graph engine shows that, even for small problems, quite impressive 
amounts of data have to be processed by the term algebra. As required by the trivial 
observation known as Amdahl's law that it helps little for speeding up a program by orders 
of magnitude to make just part of the code much faster that originally consumed half of the 
calculation time, an aggressive implementation of graph combinatorics must be matched 
by an equally aggressive term algebra. 

A. 5.1 Rules 

All the terms that occur in the calculation are sums of contributions that consist of a 
rational coefficient, optionally with an extra factor i or y/2, and a series of further factors 
from the spin(9) algebra of types 5 ab , e ijk , e mi ...m 9 , 7^/3, 7S/?> n a/3- Here ' hii ■ ■ ■ denote 
SO(3) vector indices, a,b, . . . denote SO(6) vector indices and m, a are spin(9) vector and 
spinor indices for SO(3) x SO(6) C spin(9). A projection factor IF* 1 = ~ (l ± h ^l e ijk) 
has to be inserted wherever two fermionic legs are fused - cf. ()2.3j) : 

t 1 

These projection matrices are shifted through a sequence of 7 matrices via the rule 7 i n =t = 
n ± 7», 7 a n ± = IF 7°. Note that (Il ± ) 2 = n ± and n+n - = 0, as advertised for projectors. 

As we work with a SU(2) subgroup sector of SU(4) = Spin(6), we further split the a 
indices via a — * (W,W ,Y,Y , Z, Z); we will then be only concerned with initial graphs 
formed from out-going W, Z legs. 

Explicitly, our spin(9) algebra reduction rules are: 

• (delta) S-? 1 -f 2 T,? 1 „, = T.. ?2 ..., 5 aa = 6, 5a = 3, 5 aa = 16, 5 X x = 5xx = $yy = .■■ = ! 

• (epsilon) e ijk eu m = Sji5 km - 5 jm 5 k i 

. (Clifford) 7 ^ 7 + tff^ = 2%5 a7 

• (epsilon-9) 5 aiCtl0 l\l=i 7« fcQ(fc+ i) = 16 4?...i 9 > ^Lj... = 

There would be further (obvious) reduction rules involving e.g. e^ 9 ^ • e^ 9 ^ , but we do not 
have to take care of those, as they cannot occur up to four-loop order. Likewise, we do not 
include rules of the type e( 9 M 3 ) — > - the number of such terms is expected to be quite 
limited, and treatment of such special cases may always be deferred to more generic term 
manipulation programs. 

The Clifford and epsilon-9 rules are not used in the form given here, but to determine 
beforehand a table of all the expansions of traces of gamma matrices in terms of epsilons 
and deltas that can show up in the calculation. 
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A.5.2 Strategy 

Aside from some special cases (ee — > ^ <55 and tr 7. ..7— every graph contributes 
to the coefficient of a single individual summand. Hence, we associate to every connected 
component a tight encoding of a summand which contains information about the coefficient, 
certain extra factors like i or \/2, as well as e^, e mi ... mg , 5 mn and 7^ algebraic factors 
and indices. Once a contribution has been generated, the new coefficient is added to a hash 
table mapping normalized summands to (references to) coefficients. 

Using conventional symbolic algebra, fusing legs would be performed by generating a 
new summand which is just the old one extended by cin extra S mn (or factor and 

then successively searching through all algebra reduction rules to find a place in the new 
term where some such rule can be applied. One can do much better by making use of the 
information that the original terms have been totally reduced and therefore, if any new 
reduction became possible, it has to include the newly added factor. Hence, we mark the 
newly added factor as 'hot' and successively chase it through all possible reduction rules 
that such a hot factor may participate in. Hotness is treated as contagious here: every factor 
that is 'activated' by a hot factor becomes hot itself. 

In our particular case, such an approach stratifies the algebra in the sense that the 
longest chain of reductions that a newly introduced factor may generate is a hot gamma 
closing a trace which then is expanded to a set of hot deltas, which connect different 
epsilons that originally were connected to the gamma chain to produce another set of hot 
deltas, which may rename open legs (on other 6, e, or 7 factors). No longer sequence of 
reductions than this 7 — > 5 — > e — > 5 chain may occur; in particular, there are no loops in 
our reduction rules that may be performed an unspecified number of times. Thus, at the 
heart of the term algebra lies a set of functions that represent this chain of reductions which 
may be entered at an arbitrary place, depending on the type of factor that has been added. 
As allocating memory for the many intermediate terms that have to be generated while a 
summand runs through this reduction chain would cause an excessive number of expensive 
memory reclaiming steps (garbage collection), we reduce dynamic memory allocation to a 
minimum by first copying summands that have to be reduced to a statically allocated space 
of sufficient size, which we call the 'workbench'; term transformations are directly applied to 
a term's copy on the workbench. The problem that one reduction step may branch into many 
different individual summands is then also easily dealt with: instead of a single workbench 
area, we use multiple such areas, start by copying the summand to area #0, and whenever 
the calculation branches into multiple summands, e.g. via €ijkO Lmn — ► 5j m 5k n — fijn5k m , we 
copy the term that is currently processed in area #N to area #(A r + 1) multiple times, 
once for every branch, add appropriate modifications to it, and continue processing with 
the terms in place #(N + 1), one after the other. Eventually, when no more reductions 
can be performed, we create a new summand out of the contents of the workbench area we 
presently work at. 

This is depicted in diagram |31 (Admittedly, this description is slightly simplified.) The 
method employed here to chase active parts through a term in order to avoid both unnec- 
essary checking whether some reduction rule can be applied as well as dynamical memory 
management is quite general and can be applied in every situation where reductions have to 
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Figure 3: Calculations on terms without dynamic memory management 

be performed on terms of which one knows that they are derived from non-reducible terms 
by a small number of local modifications. Indeed, it should even be possible to implement 
a system that automatically compiles term reduction rules to machine code that employs 
such a 'waterfall' scheme. 

The drawback of the decision to branch graph generation to individual summands ob- 
tained in ee and tr7 reductions is that some graph generation work is re-done spuriously 
that could be avoided if one attributed sums, not products, to individual graphs. On the 
other hand, this scheme helps cutting some branches in the graph generation (to be dis- 
cussed later). One should notice, however, that a large number of vertices has to be used 
to construct a graph that branches into many summands; as there are only at most eight 
vertices in total, the amount of work that is unnecessarily multiplied is limited to the tips 
of the graph of all possible choices. The most important advantage if this scheme is, of 
course, that it is comparatively easy to implement and debug. 

As a further trick, one notes that besides the coefficients of individual vertex sequences, 
and symmetrization factors 1/6 that come from 11^, all the denominators that appear in the 
calculation are small powers of two, with the numerators not becoming excessively large. 
One can convince oneself that up to four loops, floatingpoint calculations are exact on such 
numbers(!), so we can abuse double precision floatingpoint numbers to represent coefficients 
for a large part of the calculation and handle the other fractional factors not of this form in 
a separate final step. Besides vastly simplified memory management, this especially speeds 
up multiplication of coefficients. 

A. 5. 3 On the Graph <-> Term interplay 

Planar graph generation drives the calculation. Naively, one may consider to just memorize 
all the leg fusions during graph generation and eventually do all of the calculation of the 
term corresponding to a graph in the very last step. Alternatively, one may do as much of 
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the calculation as possible in every step throughout the incremental generation of a graph. 
The advantage of the first approach is that no term calculations at all have to be performed 
for such intermediary graphs that cannot lead to a final graph of the desired topology. The 
big disadvantage is that for different final graphs that are derived from some common stem, 
the part of the calculation that corresponds to this stem is done multiple times, once for 
each final graph, although in principle it would have been sufficient to do it only once. 

The big advantage of the second approach is that we may be able to see early that 
some stem cannot give any contribution anymore, e.g. if the coefficient became zero due 
to occurrence of a factor euj or a Il + n _ projector combination, and we may save a lot 
of expensive graph generation work for all graphs that are derived from this stem. Unfor- 
tunately, we then have to do lots of spurious term calculations for stems which will not 
produce a suitable final graph. 

Interestingly, it is possible to get the best of both approaches by using so-called lazy 
evaluation (see figure^}: instead of triggering calculations on actual terms, we let the graph 
engine generate promises to do such calculations if needed. Such a 'promise' is basically a 
data structure holding a flag whether the promise was fulfilled already, plus either the value 
of this promise (if the flag says it indeed already was fulfilled), or otherwise a function, plus 
arguments, that, when evaluated, will produce that value. Once such a promise is requested 
to be fulfilled ('forced'), the promise's flag will be set accordingly, and the function and 
arguments will be replaced by the promise's value, so that next time its value is needed, 
one can just take it, without having to call the function again. Promises are comparatively 
cheap to give - this only requires a small amount of consing. Conceptually, the idea behind 
such promises is quite similar to the idea behind stock options. In practical applications, 
a 'promise' will typically promise to first require some other promises it depends on to be 
fulfilled, and use their values to compute its own value. 

Here, we only carry promises for terms which depend on other promises for terms 
through the depth-first graph generation until we actually do generate a final graph. No 
expensive term calculations have been done so far, especially not for futile partial graphs 
that cannot be completed to a topologically interesting graph. Then, we take the graph's 
promise for a term and force it. It may be that this is zero, due to some promise in the stem 
producing a zero result, and we did some small amount of unnecessary work in generating 
this graph. However, when depth-first graph generation proceeds to the other branches that 
share a stem whose term is zero with the present graph's genealogy, we see that the promise 
for the term in the stem already has been fulfilled and we can cut generating further graphs 
from this particular stem. 

One subtlety lies in the treatment of fermionic signs: the determination of signs from 
fusion of fermionic legs has to be taken out of the promise, as one has little control over the 
point in time when a promise is being fulfilled, and the state of the flags denoting which 
legs are unbound fermionic ones may be a very different one from the state of those flags at 
the time when the promise was given (which is the proper state to base sign decisions on). 
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Figure 4: On the value of lazy evaluation: Every node corresponds to a partially constructed graph 
and its associated (lazy) term. The root node is just the initial domain. Every link leads from a 
graph to a graph which has been extended by one more vertex. In (#1), no algebraic calculations 
are performed, as graph generation gets stuck with partial graphs that cannot be completed to final 
planar graphs. In (#2), a planar graph could be generated, and term algebra forces all the promises 
for calculations down to the root to be fulfilled (bold lines). However, the result is zero, coming from 
a zero at level two that is discovered in this step only. All the other graphs and partial graphs that 
would also derive from this node are then not even generated, as the 'known zero value' information 
is used in graph generation. In (#3), we get final nonzero contributions. Note furthermore that 
intermediary results down the path to the root are only calculated once! 
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A. 5. 4 Implementation 

Due to memory access being a major bottleneck in today's CPUs, and the hierarchical 
structure of caches in computer systems, one should take some care not to unnecessarily 
waste space with data representations. However, one can over-optimize here - tight data 
representations frequently come at the expense of more code. In the present implementation, 
we use a single 8-bit byte (char) to represent legs within the term algebra. The most 
significant bit encodes whether a leg is of type ijk or abc, the other bits encode the 'name' 
of the leg (a number). This works because fermionic ABC legs can only occur at the ends of 
chains of gamma matrices, which are stored in special places. Leg type information is needed 
in two places: to determine which extra number factor to introduce when reducing <5™, and 
to determine whether a sign flip has to occur when the rule 7 a n ± = n =F 7 a , 7*n ± = H ±/ y t 
is used. Every chain of gamma matrices in a term is encoded as a string (vector of bytes), 
where byte and 1 denote the names of the fermionic ends; the high-bits of these two bytes 
encode whether the chain has an extra factor II =t: at the left end as well as its particular 
type. All delta factors are likewise encoded in a string holding pairs of bytes. Epsilon factors 
ejjfc combine three indices, hence we use one 30-bit signed integer per factor, while we use a 
string to hold nine indices for every c mit _ m g factor. As fermionic legs are just those appearing 
inside gamma chains, hence easy to discern from SO(3) ijk and SO(6) abc legs, we only 
have to discern the latter two types. This we do by using the convention that all legs given 
a number in the range 0, ... 63 (i.e. with non-set sixth bit) are of type a, while those in the 
range 64 . . . 126 are of type i. 

On the workbench, we use extra-long arrays to hold these data and fill pointers remem- 
bering how many entries are momentarily valid. Furthermore, on the workbench, we have 
to hold additional information about 'hot' factors. A further subtlety concerns two special 
types of indices: first, we get ijk indices not introduced by the legs of some vertex: we 
get these from the j^ k eijk contribution to TV* 1 when closing a gamma trace. Here, we take 
indices from the end of the numerical range (with 127 being treated as special, as it also 
denotes 'no index' in graph generation) 126,125,. . . , remembering how many such factors 
we already introduced for the present graph. Second, as we want to split the a index to 
Z, Z, W, W, Y, Y, we let indices in the range 58 ... 63 denote these special indices. As these 
split index ranges are one-dimensional, they do not provide special handles associating them 
with parts of the term; we do not need them here anymore. 

A. 6 On the Calculation 

The size of this calculation is larger by orders of magnitude than any other symbolic compu- 
tation the authors are aware of. Such jumps in complexity are quite generically accompanied 
by the discovery of new effects that did not occur on smaller scales. In this particular case, 
removing flaws from the process of running such large calculations eventually turned out 
to be more demanding than removing bugs in the code. The main reasons were: 

1. Without a dedicated supercomputer time budget for this project, computation time 
had to be borrowed from a variety of different sources. 
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2. The job queuing systems used at some sites turned out to be well adapted to the 
needs of numerical simulations, but not symbolic computations. 

3. Some especially hard parts of the calculation had to be split multiple times. 

4. Perturbation theory calculations are notoriously hard to debug if the only sign of an 
error is that the overall sum of data from 0(10 000) individual files, which take many 
CPU- years to produce, is nonsense. 

All in all, performing the actual calculation took about half a year. 
A. 6.1 Performance 

Profiling a three-loop sample calculation shows that with all of these tricks incorporated, 
the code spends most of the time in graph generation, but there is no single hot spot where 
further optimization may be useful. One might wonder whether it would be possible to 
further speed up graph generation by adding more heuristic checks that allow to determine 
early - given a set of remaining vertices that have to be added to a partially finished graph 

- whether there will be any way to eventually produce a graph with only one connected 
component carrying exactly one cell with open legs, the idea being that as soon as situa- 
tions occur like legs of type A and B that will not be bound anywhere anymore going to 
different cells, one may stop graph generation early. Experiments have shown that this is an 
important optimization if the term algebra is so expensive that it slows down partial graph 
generation. As long as there is only very little dynamic memory management going on in- 
side the graph engine, the extra effort to do such early checks is large enough to destroy any 
advantage of this approach. Again, this situation would change should reconfigurable logic 
become available on mainstream CPUs. Then, one might be able to perform such checks at 
very little extra cost. 

In table 12 we display the sizes of some calculations, done in FORM and with gemstone 

- note that one should not try to naively deduce a statement about relative quality of 
these symbolic manipulation systems from such timings, as very different algorithms are 
used for the different systems. Despite the not unimpressive size of the number of non-zero 
contributions (roughly, the number of diagrams), one should not be misled by them: the 
hard part of the work lies in generating graphs, the vast majority of which turn out to give 
no contribution. This can be glimpsed from the fact that average gemstone term generation 
speed drops from about 640/s for ZZ at three loops to about 20/s for the length-eleven 
chain at four loops, but can be as high as 1050/s at three loops or even 5100/s at four loops 
for individual sequences. If we assume that these 'fluctuations' in term generation speed are 
roughly in the ballpark of the square root of the number of total (mostly non-permissible) 
graphs per non-vanishing term, we may wildly guess that the total length-eleven chain 
calculation at four-loop order would have had somewhat like half a quadrillion possible 
Feynman graphs. At least, it is a funny coincidence to note that by doing the estimate this 
way from a 'measurement', one arrives at the same number (4.3 • 10 14 ) which is obtained 
by just multiplying the naive number of possible combinations for the four-times-four-legs 
sequence for the largest vertex class (188 4 ) with the number of different vertex sequences 
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Table 2: Sizes of various calculations 

(338 834). As this second estimate does not take such factors as the initial chain length, 
multiple ways to add one vertex to a graph, and variations in complexity between vertex 
sequences into account, this is indeed only a coincidence. 

A. 6. 2 Distributed Computing 

Evidently, such four-loop calculations are far too demanding to be performed on a single 
machine. Fortunately, they naturally decompose into many individual independent parts - 
one for each sequence of vertices - and hence are trivially parallelizable. (One may wonder 
how much time is wasted by doing graph generation over and over again for vertex sequences 
that share a long common beginning. With reasonable assumptions, one can estimate this 
spurious effort to contribute only at most a few per-cent to total calculation time.) 

Profiling a three-loop calculation shows that while some vertex sequences can be cal- 
culated much faster than others, there is no clear jump in the distribution of calculation 
times of individual pieces. Nevertheless, about 50% of the work is done for 0.5% of all vertex 
sequences, while 90% of all sequences take only 10% of the time. This is shown in figure 

It is tempting to try to provide additional mechanisms to further parallelize work done 
on individual chains, and the nature of the calculation with its many branchings suggests 
quite directly how one may achieve this. As we do not know a priori where the hot spots 
are, we want to be able to signal a running calculation to break itself up. So far, this is 
only supported very rudimentarily: if a running calculation process receives an alarm signal 
(SIGALRM, either sent externally or set as an alarm timeout via a command- line option), 
the signal handler sets a flag which tells the graph engine to record all work that has to be 
done from that instant on in a list of deferred calculations, which is then serialized, gzip- 
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Figure 5: Accumulated calculation time vs. part of sequences for ZZ at three-loop order 

compressed, base64-encoded, and written to disk. 13 (One may regard this as an ad-hoc 
approach to serializing a continuation.) The gemstone program is able to split a serialized 
calculation into multiple pieces of (hopefully) similar size. 

As serialization of a running computation is a comparatively hairy action, we imple- 
mented as well a much simpler fallback method which involves splitting vertex classes into 
two parts. After identification of those pieces of the calculation that are hard to do, a 
complete expansion of all such N vertex sequences of length L into N • 2 L parts is indeed 
sufficient to bring the calculation time of every individual piece down to less than one week. 

A.6.3 Security 

So far, the gemstone system has only been used in controlled environments. Technically, 
it would be easy to extend the system in such a way that donated computer time from 
individual volunteers can be employed; the source already contains some first support for 
exchanging data via HTTP requests directly from the client. This raises some concerns 
in conjunction with both credibility of the overall result and security of participants' and 
organizers' computers. First of all, when even only one single machine produces an erroneous 
partial result - be it out of mischief or hardware unreliability, e.g. due to bad RAM or CPU 
overclocking, this will not be noticed until the final result is available. Hence, if such a 
scheme is feasible at all, then at least, every part of the calculation would have to be 
done twice by independent volunteers. Second, the problem of handling unexpectedly large 
pieces of the calculation has to be addressed. While it would be possible to let a client 
machine produce an image of the current state of its calculation and send it back so that it 

13 There are indications of problems with some versions of the Ocaml serializer, so this feature should be 
used with care. 
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can be broken up and sent to further volunteers, this must not be permitted. Besides the 
occasionally prohibitive size of serialized calculations, a major concern is that the organizer 
of such a distributed calculation has little or no control over internal safety checks performed 
by the serializer. Thus, a mischievous participant may send in a specially manipulated data 
file which, upon re-serialization, may corrupt the Ocaml heap and in the worst case lead to 
arbitrary code execution for a different volunteer. The perhaps safest way to address this 
problem is to accept some redundant work and recursively decompose any large problem 
for which a client sends the information that it cannot be done within a given time-frame 
into smaller sub-problems by splitting vertex classes. This also has the big advantage that 
one may exert full control over the locations of these splits - hardly possible if they are 
mostly controlled by a timeout. 

A.6.4 On the Code 

While the main program is written in Objective Caml for portability, some of the Ocaml 
source files are generated by Perl and Lisp, especially if they had to be translated from 
Mathematica. (For some pieces for which speed is not essential, the machine generated 
code is quite ad-hoc and certainly not as efficient as it might be.) Thus, building the entire 
system will unfortunately require quite a large software development infrastructure. 

As this system is work in progress, there are quite some places where the code is just 
as clean as it has to be in order to be maintainable; depending on further applications of 
this codebase, it may or may not become desirable to do some extensive re-structurings 
especially of the documentation in order to enhance readability. 

A. 7 Five Loops 

Can this head-on approach be used to do the five loop calculation? One does not have 
to consider initial chains of length greater than 13 in order to fix the five-loop dilatation 
operator. As such a calculation may introduce at most nine extra indices from three e l ^ k via 
fermion loops of odd length, we would have to deal with at most 3 • 10 + 9 + 13 = 42 indices, 
well below the built-in maximum of 63 in gemstone. Furthermore, if one is interested only 
in the result and does not care about final states with fermions, which will cancel in the 
total sum anyway, small changes to gemstone that drop these terms would make it run 
with a very small memory footprint, hence maybe even suited for networked video game 
consoles. 

While gemstone not only is considerable faster than any other method used so far, but 
also scales better, the relative computational effort for three loops is 3 000 times larger 
than for two loops, and the effort for four loops is again larger by a factor 60 000. So it 
is perhaps not unreasonable to estimate that the size of the five-loop calculation should be 
at least one million times larger than the largest ones presented here. Hence, one would 
probably need something in the range of 100-500 million fast machines to do it. This is 
roughly in the ballpark of the total technical capability of mankind today. 
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A. 8 Results 



For the sake of reproducibility of our methods and findings, we include raw results of all 
the large four-loop computer calculations, as well as for the calculations determining the 
dilatation operator at two and three loops. 



WWWWZZZ at two loops: 



WWZWZWZ at three loops: 



ZZ at four loops: 
WZWZ at four loops: 

WWZZ at four loops: 

WWZWZZ at four loops: 



WWWWZZZ -44 

WWWZWZZ 26 

WWWZZWZ 26 

WWZWWZZ -8 



WWWWZZZ 16 

WWWZWZZ -612 

WWWZZWZ -612 

WWZWWZZ -1464 

WWZWZWZ 2672 



WWZZ 
WZWZ - 

WWZZ - 
WZWZ 

wzwzwz 
wwwzzz 

WWZWZZ 

wwzzwz 



88224 
-88224 

-44112 
44112 

22291 
12803 
-73356 
38262 



(A.5) 



(A.6) 



(A.7) 
(A.8) 

(A.9) 
(A.10) 
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WWWZZZZZWWZ at four loops: 



wwwwwzzzzzz 


-3448 


wwwwzwzzzzz 


-5640 


wwwwzzwzzzz 


52 


wwwwzzzwzzz 


52 


wwwwzzzzwzz 


-1000 


wwwwzzzzzwz 


22399 


wwwzwwzzzzz 


22615 


wwwzwzwzzzz 


24 


wwwzwzzzzwz 


-80 


wwwzzwwzzzz 


76 


wwwzzwzzwzz 


-32 


wwwzzwzzzwz 


928 


wwwzzzwwzzz 


-28 


wwwzzzwzwzz 


376 


wwwzzzwzzwz 


-5012 


wwwzzzzwwzz 


-2968 


wwwzzzzwzwz 


21819 


WWWZZZZZWWZ 


-63260 


wwzwwzwzzzz 


22195 


wwzwwzzwzzz 


-5064 


wwzwwzzzwzz 


928 


wwzwwzzzzwz 


-104 


wwzwzwwzzzz 


-5168 


wwzwzwzwzzz 


432 


wwzwzwzzwzz 


-32 


wwzwzzwwzzz 


-28 


wwzwzzzwwzz 


-32 



(A.ll) 



This last result receives contributions from individual sequences such as e.g. 

WWWZZZZWZWZ 3747744 

WWWWZZZZZWZ 3747744 

WWWZWWZZZZZ 3747744 

WWZWWZWZZZZ 3747744 

WWWZZZZZWWZ -292115824 



(from the sequence -£Vi )+ iVi -iT4,+i^2,+o^i,+i^i ,-iVi _i) or 

WWWZZZZZWWZ 2673/560 

(from the sequence — q^o1 / i,+31 z l,+iVi,+3V2,+iVi i _3Vi ) _3Vi i _2)- Hence the fact that the re- 
sult contains only small, allowed, integer contributions should be regarded as a highly 
nontrivial first indication of its correctness. 
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B Perturbation expansion of effective Hamiltonian 

The effective Hamiltonian acting on the su(2) subsector is given by a perturbative expansion 

T = Y,T 2 k with T 2k oc , (B.l) 

k=0 

where T 2k is called the fc-loop contribution. As building blocks we need an operator Pe 
which projects onto the subspace of states with free energy E, i.e. 

Pe\F) = &ef\F) for eigenstate |F) of Hq with eigenvalue F (B.2) 

and a "propagator" 

A. = V 

E-F 



F^E 

With these definitions, the terms in (IB.lll read as follows: 



T = H (B.4) 
T 2 = p E [^iA B Fi + V 2 ] P E (B.5) 

E 

T A = Y, p e[ViA e V 1 A e V 1 A e V 1 

E + ViA^A^ + ViA^A^i + V 2 ^eVx^ E Vx + V 2 A E V 2 

-VxAIVxPeT^Pe (B.6) 

T 6 = ^2p e [ViAeViAeViAeViAeViAeVi 

E + V 1 A e V 1 AeV 1 AeV 1 AeV 2 + V 1 AeV 1 AeV 1 A e V 2 AeV 1 
+ ViA^ViAsV^AsViAsVi + V 1 A e V 2 AeV 1 AeV 1 A e V 1 
+ VaA^A^A^A^ 

+ V 1 A E V 1 A E V 2 A E V 2 + V 1 A E V 2 A E V 1 A E V 2 + ViA^V^A^V^A^Vl 
+ V 2 AeViAeViAeV 2 + F 2 A B ViA B F 2 A B Vi + V 2 AeV 2 AeV\AeV\ 
+ y 2 A s V2A s V2 

- (^Al^A^iA^i + v x a e VxA\v x a e Vx + ^a^a^aI^ 

+ VxA e VxAeV 2 + VxAlViAEVx + F 2 A|FiA s Vi 

+ v x a e ViA e v 2 + yiA s y 2 A|y! + v 2 AeV\A e Vi 

+ V 2 A 2 E V 2 )P E T 2 

+ V X A\V X P E T 2 P E T 2 - V X A E V X P E T 4 ] P E (B.7) 
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T 8 = p[ V1AV1AV1AV1AV1AV1AV1AV1 

E + Vi A Vi AVi A Vi A Vi A Vi A V 2 + Vi A Vi A Vi AVi AVi A V 2 AVi 
+ V1AV1 AViAViAV 2 AVi AVi + V1AV1AV1AV2AV1AV1AV1 
+ Vi A Vi A V 2 A Vi A Vi A Vi A Vi + Vi AV 2 AVi AVi AVi AVi AVi 
+ V 2 AVi AVi AVi AVi AVi AVi 

+ V1AV1 AViAViAV 2 AV 2 + V1AV1AV1 AV 2 AViAV 2 + Vi AVi AVi AV 2 AV 2 AVi 
+ V1AV1 AV 2 AViAViAV 2 + Vi AVi AV 2 AVi AV 2 AVi + Vi AVi AV 2 AV 2 AVi AVi 
+ ViAV 2 AViAViAViAV 2 + Vi AV 2 AVi AVi AV 2 AVi + Vi AV 2 AVi AV 2 AVi AVi 
+ ViAV 2 AV 2 AViAViAVi + V 2 AVi AVi AVi AVi AV 2 + V 2 AVi AVi AVi AV 2 AVi 
+ V 2 AVi AViAV 2 AViAVi + V 2 AVi AV 2 AVi AVi AVi + V 2 AV 2 AVi AVi AVi AVi 
+ V1AV1 AV 2 AV 2 AV 2 + ViAV 2 AViAV 2 AV 2 + Vi AV 2 AV 2 AVi AV 2 
+ ViAV 2 AV 2 AV 2 AVi + V 2 AVi AViAV 2 AV 2 + V 2 AVi AV 2 AVi AV 2 
+ V 2 AVi AV 2 AV 2 AVi + V 2 AV 2 AViAViAV 2 + V 2 AV 2 AV X AV 2 AVi 
+ V 2 AV 2 AV 2 AViAVi 
+ V 2 AV 2 AV 2 AV 2 

- (V1AV1 AViAViAViA 2 Vi + Vi AVi AVi AVi A 2 Vi AVi + Vi AVi AVi A 2 Vi AVi AVi 
+ V1AV1 A 2 ViAViAVi AVi + Vi A 2 Vi AVi AVi AVi AVi 

+ V1AV1 AViAViA 2 V 2 + Vi AViAViAV 2 A 2 Vi + Vi AVi AVi A 2 Vi AV 2 
+ V1AV1 AViA 2 V 2 AVi + Vi AViAV 2 AViA 2 Vi + Vi AVi AV 2 A 2 Vi AVi 
+ V1AV1 A 2 VlAViAV 2 + Vi AViA 2 ViAV 2 AVi + Vi AVi A 2 V 2 AVi AVi 
+ ViAV 2 AViAViA 2 Vi + Vi AV 2 AViA 2 Vi AVi + Vi AV 2 A 2 Vi AVi AVi 
+ ViA 2 ViAViAViAV 2 + Vi A 2 ViAViAV 2 AVi + Vi A 2 Vi AV 2 AVi AVi 
+ ViA 2 V 2 AViAViAVi + V 2 AViAViAViA 2 Vi + V 2 AVi AVi A 2 Vi AVi 
+ V 2 AVi A 2 ViAViAVi + V 2 A 2 ViAViAVi AVi 

+ V1AV1 AV 2 A 2 V 2 + V1AV1 A 2 V 2 AV 2 + ViAV 2 AViA 2 V 2 + ViAV 2 AV 2 A 2 Vi 
+ ViAV 2 A 2 ViAV 2 + ViAV 2 A 2 V 2 AVi + ViA 2 ViAV 2 AV 2 + ViA 2 V 2 AViAV 2 
+ ViA 2 V 2 AV 2 AVi + V 2 AVi AViA 2 V 2 + V 2 AVi AV 2 A 2 Vi + V 2 AVi A 2 ViAV 2 
+ V 2 AVi A 2 V 2 AVi + V 2 AV 2 AViA 2 Vi + V 2 AV 2 A 2 ViAVi + V 2 A 2 ViAViAV 2 
+ V 2 A 2 ViAV 2 AVi + V 2 A 2 V 2 AViAVi + V 2 AV 2 A 2 V 2 + V 2 A 2 V 2 AV 2 )PT 2 
+ (V1AV1 AViA 3 Vi + V1AV1 A 2 ViA 2 Vi + Vi AViA 3 Vi AVi + Vi A 2 Vl A Vi A 2 Vi 
+ ViA 2 ViA 2 ViAVi + Vi A 3 ViAVi AVi 

+ V1AV1 A 3 V 2 + ViAV 2 A 3 Vi + ViA 2 ViA 2 V 2 + ViA 2 V 2 A 2 Vi 
+ ViA 3 ViAV 2 + ViA 3 V 2 AVi + V 2 AViA 3 Vi + V 2 A 2 ViA 2 Vi 
+ V 2 A 3 ViAVi + V 2 A 3 V 2 )PT 2 PT 2 

- (V1AV1 AViA 2 Vi + V1AV1 A 2 ViAVi + Vi A 2 ViAViAVi 

+ V1AV1 A 2 V 2 + ViAV 2 A 2 Vi + ViA 2 ViAV 2 + Vi A 2 V 2 AVi + V 2 AVi A 2 Vi + V 2 A 2 Vi AVi 
+ V 2 A 2 V 2 )PT 4 

- ViA 4 ViPT 2 PT 2 PT 2 

+ Vi A 3 ViP(T 4 PT 2 + T 2 PT 4 ) 

- ViA 2 ViPT 6 ]p 
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For the actual computation it is convenient to split up the vertices V% and V 2 into parts of 
definite energy shift 

[H , V ijSE ) = SE V iySE * = 1, 2 . (B.9) 

Since this implies 

A E Vi,8E = V i)S E&E-6E (B.10) 

PeV^se = V ij5E P E _ SE (B.ll) 

we are able to remove all projectors and propagators from the formulas above by moving 
them to the right end and using P e Pf = 5 EE P E . 
The PWMT vertices (|2,lj) have components 

3 4 
Vi= V ^M and V 2 =Y. y 2,fcM (B.12) 

k=-3 fc=-4 

where pieces with positive and negative energy shift are related by (V^se) = Vi,-6E- We now 
list the vertices as they read after planar normal ordering, i.e. all planar self-contractions 
of the vertices are already taken into account: 



Vi,m = ~ ■^M^ktrb\b)b k + fyfrJcfitrOlelcb + iyMricetrOtibtOf,] (B.13) 

Vi,2M = ~ ^(7a)^tr^^t a t (B . 14) 

V l ,zM= + ^=e l3k trb\b)bi (B.15) 

V 2fi =^ + 13N tr a\a a + 77V tr b\h 

-IMt^UMM + sKF^[b\M[b)M - j^tviblb]}^) 

- ]Js tr [a\, a b ] [4 , Of,] + tr [4 , a a ] [aj , a*] ~ jjp tr [4, aj] [a a , a b ] 

- ^ti[al,b\][a a ,bi] - tx[a\, h][a a , b\] (B.16) 

V 2 ,m = ~ ^tr44 + j^tr[4,4][4> a b] 

-tjfrtr[a a ,bl][aa,bl] + ^ tr[4, b\][al h] (B.17) 

V 2 , 2M = - ^trfcjftt + ^tr[b\,b)][b\,b 3 ] 

- 4^tr[4,aJ][4,4] + ^tr[4,6l][a a ,fe|] (B.18) 

^a,3Jif = -4^rtr[4,6{][4,6{] (B.19) 

V2,m = -^^lb%b\,b^ (B.20) 
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These are the building blocks for planar graphs (see Fig. P), e.g. 



(B.21) 

i 

The arrows represent clockwise the oscillators in a trace, creation operators are out-going 
arrows and annihilation operators are in-going ones, the type of field determines the color- 
ing. In the actual computer implementation we have split the so (6) index a according to 
Q2.6|) into six different colors. 

C Higher charges 

We now give the next two charges ()3.15j) beyond the spin-chain Hamiltonian Q2 (|4.1|) . (|3.1U|) . 
They are not uniquely determined from demanding that they commute with each other 
and with the Hamiltonian. The remaining freedom has been fixed in order to match the 
eigenvalue formulas ()D.2|) . 

Q 3 ,2 = »({o,i}-{i,o}) (c.i) 

Q 3 ,4 = - 8i({0, 1} - {1, 0}) + 2i({0, 1, 2} - {2, 1,0}) (C.2) 

Qsfi = 73i({0, 1} - {1, 0}) - 28i({0, 1, 2} - {2, 1, 0}) 

- 2i({0, 1, 3} + {0, 2, 3} - {0, 3, 2} - {1, 0, 3}) 

+ 5*({0, 1, 2, 3} - {3, 2, 1,0}) - t({0, 2, 1, 3} - {1, 0, 3, 2}) 

- i({0, 1, 3, 2} - {0, 3, 2, 1} + {1,0, 2, 3} - {2, 1, 0, 3}) (C.3) 

Q 3 ,8 = - (751 + a)i({0, 1} - {1,0}) + ±i(701 + 4a)({0, 1, 2} - {2, 1, 0}) 
+ if i({0, 1, 3} + {0, 2, 3} - {0, 3, 2} - {1, 0, 3}) 

- i(106 + a)({0, 1, 2, 3} - {3, 2, 1,0}) + 19i({0, 2, 1, 3} - {1, 0, 3, 2}) 
+ |i(41 - a)({0, 1, 3, 2} - {0, 3, 2, 1} + {1,0, 2, 3} - {2, 1,0, 3}) 

+ 2i({0, 1, 4} + {0, 3, 4} - {0, 4, 3} - {1, 0, 4}) - 4i({0, 1, 3, 4} - {1, 0, 4, 3}) 

- 4i({0, 1, 2, 4} + {0, 2, 3, 4} - {0, 4, 3, 2} - {2, 1, 0, 4}) 

+ 14i({0, 1, 2, 3, 4} - {4, 3, 2, 1, 0}) + ia({0, 3, 2, 1, 4} - {1,0, 2, 4, 3}) 
+ + 2a)({0, 2, 1, 3, 2} + {1, 0, 2, 1, 3} - {1,0, 3, 2, 1} - {2, 1, 0, 3, 2}) 

- |i(6 + a)({0, 1, 3, 2, 4} + {0, 2, 1, 3, 4} - {1, 0, 4, 3, 2} - {2, 1, 0, 4, 3}) 

- |i(6 - a)({0, 1, 2, 4, 3} - {0, 4, 3, 2, 1} + {1, 0, 2, 3, 4} - {3, 2, 1, 0, 4}) (C.4) 




51 



Q*,2= - SO + IW- §({0,1} + {i,o» 

- |({0, 2, 1} + {1, 0, 2}) + §({0, 1, 2} + {2, 1, 0}) 



(C.5) 



Q M = 20{} - ±§±{0} + f ({0, 1} + {1, 0}) + |{0, 2} 

- f ({0, 1, 2} + {2, 1, 0}) + f ({0, 2, 1} + {1, 0, 2}) - |{1, 0, 2, 1} 
+ 2({0, 1, 2, 3} + {3, 2, 1, 0}) - |({0, 2, 1, 3} + {1,0, 3, 2}) 

- |({0, 1, 3, 2} + {0, 3, 2, 1} + {1, 0, 2, 3} + {2, 1, 0, 3}) (C.6) 

Q 4 ,6 = - 252{} + ±f±{0} - 154({0, 1} + {1, 0}) - if {0, 2} 

+ 2§2 ({0, ^ 2} + {2j x 0}) _ 244 ({0; 2) 1} + {lj Qj 2}) + 0> 2j 1} 

- f {0, 3} + 4({0, 1, 3} + {0, 2, 3} + {0, 3, 2} + {1, 0, 3}) 

- 38({0, 1, 2, 3} + {3, 2, 1, 0}) + f ({0, 2, 1, 3} + {1, 0, 3, 2}) 
+ f ({0, 1, 3, 2} + {0, 3, 2, 1} + {1, 0, 2, 3} + {2, 1, 0, 3}) 

+ |({0, 1, 4, 3} + {1, 0, 3, 4}) - |({0, 1, 3, 4} + {1, 0, 4, 3}) 
+ |({0, 2, 1, 4} + {0, 2, 4, 3} + {0, 3, 2, 4} + {1, 0, 2, 4}) 

- |({0, 1, 2, 4} + {0, 2, 3, 4} + {0, 4, 3, 2} + {2, 1, 0, 4}) 

+ 6({0, 1, 2, 3, 4} + {4, 3, 2, 1,0}) + |({0, 3, 2, 1, 4} + {1, 0, 2, 4, 3}) 
+ |({0, 2, 1, 4, 3} + {1, 0, 3, 2, 4}) + |({0, 1, 4, 3, 2} + {2, 1, 0, 3, 4}) 

- |({0, 2, 1, 3, 2} + {1, 0, 2, 1, 3} + {1,0, 3, 2, 1} + {2, 1, 0, 3, 2}) 

- 2({0, 1, 3, 2, 4} + {0, 2, 1, 3, 4} + {1,0, 4, 3, 2} + {2, 1,0, 4, 3}) 

- 2({0, 1, 2, 4, 3} + {0, 4, 3, 2, 1} + {1,0, 2, 3, 4} + {3, 2, 1, 0, 4}) (C.7) 

Q 4 ,s = + |(4763 + 2a){} - §(10349 + 16a){0} + ±(6139 + 22a)({0, 1} + {1, 0}) 
+ §(529 + 4a){0, 2} - ^{1, 0, 2, 1} 

- ±(4391 + 12a)({0, 1, 2} + {2, 1, 0}) + (961 - 2a)({0, 2, 1} + {1, 0, 2}) 

+ §(271 + 2a){0, 3} - ±(323 + 4a)({0, 1, 3} + {0, 2, 3} + {0, 3, 2} + {1, 0, 3}) 
+ ±(1793 + 6a)({0, 1, 2, 3} + {3, 2, 1, 0}) - ±(1163 + 16a)({0, 2, 1, 3} + {1, 0, 3, 2}) 
+ ±(59 + 2a)({0, 2, 1,3,2} +{1,0,2,1,3} + {1,0,3,2,1} +{2,1,0,3,2}) 
+ 12{0, 4} - f {0, 2, 4} - 8({0, 1, 4} + {0, 3, 4} + {0, 4, 3} + {1, 0, 4}) 

- ±(227 + 2a)({0, 2, 1, 4} + {0, 2, 4, 3} + {0, 3, 2, 4} + {1, 0, 2, 4}) 

- ±(955 - 4a)({0, 1, 3, 2} + {0, 3, 2, 1} + {1, 0, 2, 3} + {2, 1, 0, 3}) 
+ ±(275 + 2a)({0, 1, 2, 4} + {0, 2, 3, 4} + {0, 4, 3, 2} + {2, 1, 0, 4}) 

- ±(207 - 4a)({0, 1,4,3} +{1,0,3,4}) + ±(271- 4a)({0, 1,3, 4} +{1,0,4,3})+ 
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+ 1(145 - a)({0, 1, 2, 4, 3} + {0, 4, 3, 2, 1} + {1, 0, 2, 3, 4} + {3, 2, 1, 0, 4}) 

- §(234 + a)({0, 1, 2, 3, 4} + {4, 3, 2, 1, 0}) 

+ |({0, 3, 2, 4, 3} + {1, 0, 2, 1,4}) - 1(41 - 2a)({0, 2, 1, 4, 3} + {1, 0, 3, 2, 4}) 

- f ({0, 3, 2, 1, 4} + {1, 0, 2, 4, 3}) - f ({0, 1, 4, 3, 2} + {2, 1, 0, 3, 4}) 

+ |(148 + a)({0, 1, 3, 2, 4} + {0, 2, 1, 3, 4} + {1, 0, 4, 3, 2} + {2, 1, 0, 4, 3}) 

- |(11 - 4a)({0, 2, 1, 3, 2,4} + {2, 1, 0, 4, 3, 2}) 

- f ({0, 1, 3, 2, 4, 3} + {1,0, 2, 1, 3, 4} + {1, 0, 4, 3, 2, 1} + {3, 2, 1, 0, 4, 3}) 
+ i({0, 3, 2, 1, 4, 3} + {1, 0, 2, 1,4, 3} + {1, 0, 3, 2, 1,4} + {1,0, 3, 2, 4, 3}) 
+ |({0, 1,4,5} + {1,0,5,4}) - |({0, 1, 5,4} + {1,0,4, 5}) 

- |({0, 2, 1, 5} + {0, 3, 5, 4} + {0, 4, 3, 5} + {1, 0, 2, 5}) 
+ |({0, 1, 2, 5} + {0, 3, 4, 5} + {0, 5, 4, 3} + {2, 1, 0, 5}) 

+ |({0, 2, 3, 5, 4} + {0, 3, 2, 1, 5} + {0, 4, 3, 2, 5} + {1, 0, 2, 3, 5}) 
+ |({0, 2, 1, 3, 5} + {0, 2, 4, 3, 5} + {0, 3, 2, 5, 4} + {1, 0, 3, 2, 5}) 
+ |({0, 1, 4, 3, 5} + {0, 2, 1, 4, 5} + {1, 0, 2, 5, 4} + {1, 0, 3, 5, 4}) 
+ |({0, 1, 3, 5, 4} + {0, 2, 1, 5, 4} + {1, 0, 2, 4, 5} + {1, 0, 4, 3, 5}) 
+ |({0, 1, 3, 2, 5} + {0, 2, 5, 4, 3} + {0, 3, 2, 4, 5} + {2, 1, 0, 3, 5}) 
+ |({0, 1, 2, 5, 4} + {0, 1, 5, 4, 3} + {1, 0, 3, 4, 5} + {2, 1, 0, 4, 5}) 

- 4({0, 1, 2, 4, 5} + {0, 1, 3, 4, 5} + {1,0, 5, 4, 3} + {2, 1, 0, 5, 4}) 
-4({0,1, 2,3,5} + {0,2,3,4,5} +{0,5,4,3,2} + {3,2,1,0,5}) 

+ |(2 - a)({0, 4, 3, 2, 1, 5} + {1, 0, 2, 3, 5, 4}) + |(2 + a)({0, 2, 1,4, 3, 5} + {1, 0, 3, 2, 5, 4}) 
+ |({0, 2, 1, 3, 5, 4} + {0, 3, 2, 1, 5, 4} + {1,0, 2, 4, 3, 5} + {1, 0, 4, 3, 2, 5}) 

- |(13 + 4a)({l, 0, 2, 1, 3, 2} + {2, 1, 0, 3, 2, 1}) - 1(5 + 4a)({0, 2, 1,4, 3, 2} + {2, 1, 0, 3, 2, 4}) 
+ 1(2 + a)({0, 1,4, 3, 2, 5} + {0, 3, 2, 1,4, 5} + {1, 0, 2, 5, 4, 3} + {2, 1, 0, 3, 5, 4}) 

+ 1(2 - a)({0, 1, 3, 2, 5, 4} + {0, 2, 1, 5, 4, 3} + {1, 0, 3, 2, 4, 5} + {2, 1, 0, 4, 3, 5}) 
+ f ({0, 1, 2, 3, 4, 5} + {5, 4, 3, 2, 1, 0}) - f ({0, 1, 3, 2, 4, 5} + {2, 1, 0, 5, 4, 3}) 
+ |({0, 1, 2, 5, 4, 3} + {0, 1, 5, 4, 3, 2} + {2, 1, 0, 3, 4, 5} + {3, 2, 1, 0, 4, 5}) 

- 1(18 + a)({0, 1, 2, 4, 3, 5} + {0, 2, 1, 3, 4, 5} + {1, 0, 5, 4, 3, 2} + {3, 2, 1,0, 5, 4}) 

- 1(18 - a)({0, 1, 2, 3, 5, 4} + {0, 5, 4, 3, 2, 1} + {1, 0, 2, 3, 4, 5} + {4, 3, 2, 1, 0, 5}) (C.8) 

D Spin chain spectrum 

In this appendix we list the eigenvalues 92,3,4 of the charges Q2,3,4 f° r spin-chains of length 
L and magnon number M. The eigenvalues have been computed once by directly applying 
the corresponding operators to explicit states, and a second time by solving the Bethe 
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equations for the momenta and using the following eigenvalue formulas 

M 

Qi = ^Qi(Pj) (D- 1 ) 
i=i 

with 

g 2 (p) =8sin 2 (|)-32A r sin 4 (2) 

+ 256A 2 . sin 6 (§) - 2560A; 5 sin 8 (|) , (D.2a) 

Q3{p) = 8sin(p) sin 2 (|) — 64A r sin(p) sin 4 (|) 

+ 640A 2 sin(p) sin 6 (f) - 7168A^ sin(p) sin 8 (|) , (D.2b) 

g4 (p) = f sin(f ) sin 3 (|) - 128A r sin(f ) sin 5 (|) 

+ 1536A 2 . sin(f ) sin 7 (|) - ^A 3 r sin(f ) sin 9 (|) . (D.2c) 

These formulas are in accordance with the general all-loop expressions for qi given in [HI 

must 

The stars "***" in the tables below indicate that the higher loop eigenvalues could not be 
determined due to wrapping issues. The value q 2 , 8 = -^^A;! for the state ( L > M ) = ( 4 > 2 ) 
is also subject to wrapping, but was determined in a separate calculation Q2.31JI . If we used 
the eigenvalue formula (|D.2a|) one would get the wrong value 92,8 = — 3054A 3 . 

"Singular" means that the Bethe equations become singular for the lowest order solution 
of the Bethe momenta pi. In these cases one should use the inhomogeneous Bethe ansatz 
as in [S]. 

Unpaired states are annihilated by Q3, hence q% = 0. In these cases we give the parity 
of the corresponding multiplet in the column for q%. If Q3 does not annihilate a state, 
then it maps the state to the partner state of opposite parity. As the generic Q3 and the 
parity operator do not commute (they anti-commute) , we cannot diagonalize both operators 
simultaneously. In these cases we print the pair of eigenvalues of Q3 which are always the 
negative of each other. All Bethe momenta are also opposite to each other for a degenerate 
pair of states. 



L 


M 


<?2 


93 


<34 


Bethe momenta 


4 


2 


12 


+ 











-48A r 




* * * 


P2 = — Pl 






+336A^ 












12771 a3 
4 lv r 








5 


2 


8 

-24A r 

+136A^ 

-1024A^ 




16 

3 

-32A r 

* * * 


Pl = § 
P2 = -Pl 



Table 3: Eigenvalues of Q2,Qz,Q& 



We are grateful to V. Dippel for providing us some of these solutions from her unpublished work. 
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L 


M 


92 


<?3 


94 


Bcthc momenta 


6 


2 


2(5 - %/5) 

-(34 - 10a/5)At. 

+ 1(1170 - 414 V / 5)A? 

- 1(10695 - 4134v / 5)A3 


+ 


-f(i-Vs) 

+12(5 + 3-x/5)A r 
-6(127 + 65\/5)A2 
* * * 


Pi = T 
P2 = -Pi 


2(5 + a/5) 

-(34 + 10a/5)A,. 

+ i(1170 + 414\/5)A 2 

- 1 (10695 + 4134X/5) A3 


+ 


-f (1W5) 
+12(5 - 3i/5)A r 
—6(127 — 65v / 5)Aj; 
* * * 


Pi = T 

P2 = -Pi 


3 


12 

-36A r 

+252A 2 . 

-2484A3 


- 


-12 

+144A r 

-1620A^ 

* * * 


"singular" 


7 


2 


4 

-6A r 

+ 37A2 

335 A 3 
4 il r 


- 


8 
3 

-10A r 
_|_81 A 2 

759 a 3 


Pl = f 
P2 = -Pi 


12 

-42A r 


- 




-18A r 

+ 13311 A 3 


Pl = 4f 

P2 = -Pi 


3 


10 

-30A r 
_i_9nn A 2 

3565 A 3 
2 r 


±8VTEA r 

±668 v^A^ 


1U 

3 

+50A r 

K7A A 2 
— 1 UiV r 

+ 13245 A? 


pi = ±1.16 + 0.93? 
p 2 = ±1.16 + 0.93? 

P3 — — Pi — P2 


8 


2 


3.01 

-3.32A r 
+7. 66 A 2 
-27.30A» 


+ 


1.70 

-4.83A r 

+14.81A 2 

-53.90A 3 . 


w = ¥ 

P2 = -Pi 


9.78 

-29.22A r 

1 1 P.1 ft/1 A 2 
+ 10 f .04A r 

-1256. 15A^ 


+ 


4.42 

-26.67A r 

1 1 *7ft 1 O A 2 
+ 1 / D.lZiV r 

-1204.33A 3 


Pl = ^ 
P2 = -Pi 


15.21 
-59.45A r 
+45o.70A^ 
-4430.55A3 


+ 


-15.46 
+187.49A r 
— 2190. 93A£ 
+26640.90A 3 


Pl = f 
P2 = -Pi 


3 


8 

-20A r 

+112A 2 . 

-842A 3 


+4 

±28A r 

+210A 2 . 

±1743A3 


4 

3 

-4A r 
+16A 2 

_52 A 3 


pi = ±0.96 ± 0.59i 
p 2 = ±0.96 + 0.59i 
P3 = -Pi - P2 


12 

-36A r 

+264A 2 . 

-2592A 3 




-12 
+132A r 
-1536A 2 
+19056A 3 


"singular" 


4 


6.49 

-7.56A r 

+10.22A 2 . 

+10.25A3 


+ 


-0.93 
+10.15A r 
-55.32A 2 
+285. 14A 3 


pi = 1.13 + 0.87i 
p 2 = 1.13 - 0.87i 
P3 = -Pi 
P4 = -P2 


10.90 

-31.76A r 
+249.76A 2 . 
-2538. 27A^ 


+ 


-11.67 
+129.44A r 
-1528. 91A 2 
+19065. 27A 3 


pi = 2.98 
p 2 = -1.07i 
P3 = -Pi 
P4 = -P2 


22.60 
-88.68A r 
+636.03A 2 . 
-5933.98A3 


+ 


-8.73 
+148.40A r 
-1951. 77A 2 
+25244.26A 3 


pi = 2.63 
p 2 = 1-52 
P3 = -Pi 

P4 = -P2 



Table 3: Eigenvalues of Q2,Q3,Qa (cont.) 
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L 


M 


92 


93 


94 


Bethe momenta 


9 


2 


4(2 - v^2) 

-(26 - 17\/2)A r 

+ |(179 - 993v^2)A? 

(51376 - 36103\/2)A3 




+10(4 - 3V?)K 

-^(ss-esv^A^ 

+ ^(260000 - 184411v / 2)A 3 . 


Pi = 7 
P2 = -Pi 


8 

-20A r 
+98A2 
-639A^! 




~3~ 

-32A r 
+204A2 
4192 »3 
3 r 


PI = f 
P2 = -PI 


4(2 + V2) 

-(26 + 17\/2)A r 

+ |(179 + 993V^)A2 

(51376 + 36103^^3 




-|(1 + V2) 

+10(4 + 3\/2)A r 

-^(88 + 63\/2)A2 

+ ^(260000 - 184411a/2)A3 


Pl = ^ 
P2 = -PI 


3 


6.45 

-13.18A r 
+60.73A2 
-378.34A 3 , 


±3.44 
+20.01A r 
±124.66A2 
+859. 22A 3 - 


2.67 

-15.38A r 

+103.80A2 

-748.03A 3 - 


pi = ^0.83 + 0.43i 
p 2 = ^0.83 ± 0.43i 
P3 = -Pi - P2 


11.04 

— 33.50 A r 

+231.93A2 

-2149.91A 3 , 


±2.99 
+23.92A r 
±223.38A2 
+2345. 23A 3 , 


-6.89 
+83.02A r 
-952.18A2 
+11361. 10A 3 - 


pi = +1.28+ 1.26i 
P2 = +1.28 ± 1.26i 
P3 = -Pl - P2 


16.51 
-55.32A r 
+383.33A2 
-3494. 75 A^ 


±8.61 
+53.35A r 
+414.82A2 
+3879. 59A 3 , 


-9.12 
+120.35A r 
-1471. 62A2 
+18242.92A 3 - 


pi = ±2.98 
P2 = ±1-15 

P3 = -Pl - P2 


4 


10 

-30A r 
+220A2 
4185 A a 


+V11 
±ffv / TTA r 

± 9 T^ 1 v^TA« 


22 

3 

+86A r 
-982A2 
+ 23 l n Af 


pi = ±2.63 

p 2 = T 0.53 ± 0.88i 

P3 = +1-77 

P4 = -Pl — PI- P3 


4(3 - x/3) 

-18(2 - \/3)A r 

+ |(456 - 255v / 3)A2 

-±(7908 - 4541 \/3) A 3 . 






+ (24 - 12\/3)A r 
-(360 - 198\/3)A2 
+ ±(9540 - 5417v%)A3 


pi = 0.94 + 0.55i 
P2 = 0.94 - 0.55i 
P3 = -Pi 

P4 = ~P2 


4(3 + V3) 

-18(2 + \/3)A r 

+ i(456 + 255 v / 3)A2 

-|(7908 + 4541 V / 3)A 3 . 






+ (24 - 12V3)A r 
-(360 - 198\/3)A2 
+ ±(9540 + 5417 v %)A 3 , 


pi = 2.28 
P2 = 1-28 
P3 = -Pi 

P4 = -P2 



Table 3: Eigenvalues of Q2,Qz,Qi (cont.) 
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